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TUMOR SUPPRESSOR GENE AND METHODS FOR DETECTION OF 
CANCER, MONITORING OF TUMOR PROGRESSION AND 

CANCER TREATMENT 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention. 

The invention relates to the detection of genetic abnormalities that confer 
susceptibility to certain cancers in humans. More specifically, the invention 
relates to methods for detecting deletions of, or polymorphisms in, a newlv 
discovered gene which encodes a tumor suppressor. 

10 2. History of the Prior Art. 

In recent years, a growing body of evidence has developed which supports the 
theory that the development of certain tumors is suppressed by gene products 
("tumor suppressors") which inhibit cellular proliferation (see , e.g., the review in 
Marx, Science, 2§&3 19-320, 1994). Conversely, if the tumor suppressors which 

15 would ordinarily be present in a cell are either absent (due, for example, to a 
gene deletion) or less active (due, for example, to a gene mutation), tumor 
growth which would otherwise be inhibited may go unchecked. However, 
although the growth of certain tumors has been positively demonstrated to relate 
to the deletion of a tumor suppressor expressing gene, it has not yet been shown 

20 that mutations in the same genes will allow abnormal cellular proliferation to 
occur. 

The growth cycle of eukaryotic cells is regulated by a family of protein kinases 
known as the cyclin-dependent kinases ("CDK's"). As shown in Figure 1 , the 
cyclins and their associated CDK's move cells through the three phases of the 
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growth cycle (G 1t S and G 2l respectively) leading to division in the mitosis phase 
(M). The cyclin/CDK complexes whose role in cellular proliferation has been 
most clearly defined to date are the cyclin D/CDK enzymes, which are believed 
to assist in the progression of the G A growth cycle phase. Of these enzymes, 
5 cyclin D1 is believed to be an oncogene, whose overexpression stimulates 
excessive cell division through the continuous production of kinase, thus 
contributing to the development of cancers of, for example, the breast and 
esophagus. Cyclin D1 is specifically bound by CDK4 as part of a multi-protein 
complex that also consists of a protein known as p21 and cell nuclear antigen. 

1 0 Known inhibitors of such cyclin/CDK overexpression include the tumor suppres- 
sor protein p53 and the protein product of the retinoblastoma (Rb) gene. 
Recently, another putative inhibitor (p16) was isolated and a cDNA for the 
inhibitor was partially sequenced by Serrano, et a/. ( Nature, 366:704-710, 1993. 
The authors demonstrated that p16 binds CDK4 to inhibit the activity of the 

15 CDK4/cyclin D enzymes. Based on data indicating that p16 prevented 
phosphorylation by CDK/cyclin D of certain Rb growth cycle proteins, the authors 
proposed that p16 acts in vivo upstream and downstream of Rb to form a 
negative feedback loop to regulate cellular proliferation. However, no connection 
between p16 and the occurrence or inhibition of particular cancers was 

20 suggested, nor has any information been published concerning the genomic 
structure of the gene encoding p16. 
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SUMMARY OF THE INVENTION 

Prior to the publication of the Serrano, et a/., article referred to above, the 
inventors discovered a tumor suppressor gene (hereafter, "CDK4r) and identified 
its genomic structure (see . SEQ ID NO's: 1-2). In non-malignant cells, CDK4I 
5 maps to chromosome 9p21 and is physically adjacent to the gene for 
methylthioadenosine phosphorylase (MTAse) ( see . Figure 4(b)). MTAse 
deficiencies resulting from deletions of, or mutations in, the gene for MTAse have 
been shown to be directly related to the onset of certain cancers (see . Nobori, et 
a/., Cancer Res. 53:1098-1101, 1993, the disclosure of which are incorporated 
10 herein for reference regarding the role of MTAse in cancer development, and 
SEQ ID NO: 14, the nucleotide sequence of genomic MTAse). 

Approximately one-half of all tumor cells which have been identified to date as 
either lacking CDK4I or containing mutations or rearrangements (collectively, 
"polymorphisms") of the CDK4I gene also lack MTAse. The inventors have also 
15 identified mutations in the CDK4I gene which are present in the tumor cells of 
patients with certain cancers. The invention is therefore directed to methods to 
detect (a) deletions of the CDK4I gene in cells, and (b) polymorphisms, which 
deletions and polymorphisms are indicative of susceptibility to certain cancers. 

More specifically, in one aspect, the invention comprises methods for detecting 
20 point mutations in, or deletions of, the CDK4I gene. Such methods include 
polymerase chain reaction (PCR) based assays, gel electrophoresis of single- 
strand conformation polymorphisms, direct sequencing, and restriction 
endonuclease digestion. Detection of a deletion of the CDK4I gene will 
preferably be performed by a unique competitive PCR technique. 



i 
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ln another aspect, the invention comprises methods for detection of CDK4I 
proteins and biologically active fragments thereof (collectively, "CDK4r) in a 
biological ceil sample. 

In another aspect, the invention comprises screening protocols for susceptibility 
to particular cancers based on detection of polymorphisms associated with the 
occurrence of the cancers. 



In another aspect, the invention comprises screening protocols for susceptibility 
to particular cancers based on detection of polymorphisms in, or deletions of, the 
genes for both CDK4I and MTAse, as well as detection of deficiencies in the 
1 0 products of the genes. 

In another aspect, the invention comprises genomic CDK4I, expression products 
of the CDK4I gene, CDK41 and fragments thereof, as well as antibodies which 
will specifically bind CDK4I gene expression products, CDK4I and CDK4I 
fragments. 
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FIGURE 8 depicts the results of PCR-based assays for the CDK4I gene in 
several human malignant cell lines. Lane 1=placental cells, lane 2=SK-MEL-31 
(ATCC HTB73; a melanoma cell line), lane 3=WM 266-4 (ATCC CRL 1676; a 
melanoma cell line), lane 4=T98G (a glioma cell line), lane 5=BV173 
5 (ATCC ; a ), lane 6=CEM (ATCC CCL 119; a lympho- 
blastic leukemia cell line), lane 7=MOLT-4 (ATCC 1582; a lymphoblastic 
leukemia), lane 8=A-549 (ATCC CCL 185; a non-small cell lung cancer cell line), 
lane 9=SK-MES-1 (ATCC HTB 58; a non-small cell lung cancer cell line). Lane 
10 has no templates and lane 1 1 has DNA markers. 

10 FIGURE 9 depicts the results of reverse transcriptase PCR-based assays for 
mRNA corresponding to the CDK4I gene in several malignant cell lines. Lane 
1=WIL2-NS (ATCC CRL 8155; a normal lymphoblastoid cell line), lane 2=U937 
(ATCC CRL 1593; a leukemia cell line), lane 3=T98G (ATCC CRL 1690; a glioma 
cell line), lane 4=H661 (ATCC ; a non-small cell lung cancer cell line), lane 

15 5=A-549 (ATCC CCL 185; a non-small cell lung cancer cell line), and lane 6=SK- 
MES-1 (ATCC HTB 58; a non-small cell lung cancer cell line). M=DNA markers. 

FIGURE 10 is the full-length genomic nucleotide sequence for MTAse, with the 
exons underlined. 
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DETAILED DESCRIPTION OF THE INVENTION 

I. IDENTIFICATION AND CHARACTERIZATION OF 
GENOMIC CDK4I 

In the Sequence Listing appended hereto, the full-length genomic nucleotide 
sequence for the human CDK4I gene is set forth at SEQ ID NO's: 1 and 2 (and 
is reproduced in Figure 2 (a-b)). SEQ ID NO's: 3-5 contain the nucleotide 
sequences for the CDK4I gene exons; these exons are underlined in Figure 2 (a- 
b), thus showing the boundaries between the exons (hereafter, "CDK4r\ 
"CDK4l3 m and "CDK4I5 ,M ) as well as introns of the gene. The CDK4I' exon 
contains a palindromic region of 4 inverted repeats which likely contribute to the 
structural stability of the expressed CDK4I protein. Comparison to the reported 
p16 cDNA sequence (Serrano, et a/., Nature, supra) reveals that the reported 
sequence contains regions encoding for E.coli proteins and differs in its 5" region 
from the CDK4I gene by several base pairs, including a single misplaced 
nucleotide which creates a stop codon in the middle of the 5 1 coding region ( see , 
comparison contained in Figure 3; the relevant portions of genomic CDK4I are 
shown along the bottom line while the Serrano, etal., partial sequence (5 1 region) 
is shown along the top line. Differences in the sequences are indicated by the 
absence of vertical connecting lines). 

Genomic CDK4I was identified and characterized as described below. 
The CDK4I gene was believed to reside on chromosome 9p between the loci for 
MTAse and the interferon alpha ("INF-a") gene cluster. This location was 
suggested by the fact that many malignant cell lines with deletions in chromo- 
some 9p either lack MTAse or have hemizygous or homozygous deletions of 
INF-a. In particular, a small 9p deletion identified in the T98G glioma cell line 
(ATCC Accession No. CRL 1690) centromeric to the INF-a loci was focused upon 
as a possible location for CDK4I. 
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As described in greater detail in Example I, the putative location for CDK4I was 
explored with a MTAse cDNA that was used to probe a human placenta lambda 
phage library (SEQ ID NO: 5 contains the genomic nucleotide sequence for 
MTAse; see also . ATCC Accession Nos. 55536-55540). Starting with a 2 
5 kilobase Hind III fragment (MTAse clone 7-2; ATCC Accession No. 55540), 
chromosome walking was performed and t through screening of subsequent 
lambda phage libraries, clones were isolated which encompassed the deleted 
region in T98G cells. The region of chromosome 9p21 between the loci for the 
MTAse gene and the INF-a gene was sequenced focusing on the deleted 
10 segment in T98G; the sequence is contained in Figure 4 (a). 

45 cancer cell lines were screened to determine the frequency of deletions of the 
putative tumor suppressor gene and other sites in region identified in Figure 4(a). 
Data obtained from this assay are shown in Figure 8. Introns from the two most 
frequently deleted sites are identified in Figure 4 (b) as sequence tagged site 

15 (STS) 54F and STS 5BS, which sites are separated by a 50 kilobase region. 
Probes were designed to specifically bind to portions of the 50 kilobase region 
between STS 54F and STS 5BS (SEQ ID NO's: 6-7). The most frequently 
deleted region was identified by a 19 kilobase lambda phage clone (10B1-10) 
( see . Figure 4 (a)). As described in Example I, the CDK4I gene was found to 

20 reside in the region of chromosome 9 which corresponds to clone 1 0B 1-10 
(CDK4I3' and CDK4I') and a related clone 10A1 (CDK4I5'). 

The CDK4I gene is contained in two E.co// strains (containing, respectively, 
10B1-10 and 10A1) on deposit with the American Type Culture Collection 

("ATCC"), deposited on April 14, 1994 under Accession Nos. . 

25 However, no admission that this deposit was necessary to the enablement of this 
disclosure or any of the claims contained herein is made or intended. 
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As shown in Figure 2 and SEQ ID NO's: 3-5, the CDK4I exon of the CDK4I gene 
has a 306 base pair open reading frame, the CDK4I3 1 exon has a short open 
reading frame corresponding to the last 15 base pairs of the coding region for 
CDK4I and the CDK4I5' exon has a 139 base pair open reading frame. 

II. FREQUENCY OF DELETION OF THE CDK4I GENE IN CANCER CELL 
LINES 

Many cancers cluster In families. For example, of approximately 30,000 new 
cases of cutaneous melanomas diagnosed annually in the United States, about 
5-10% originate in a familial setting (see . Cannon-Albright, et a/., Science, 
258:1 148-1 152). The locus for familial melanoma has previously been identified 
as chromosome 9p21 t a region that is reproducibly deleted in sporadic 
melanomas (Fountain, et a/., Proc.Natl.Acad.Sci.USA, 89:10557-10561, 1992). 
In addition, environmental factors, such as exposure to ultraviolet rays and 
cigarette smoking have been identified as major risk factors for the development 
of melanomas in the former case and of lung, bladder, head, neck, and larynx 
cancers. For example, abnormalities of chromosome 9p21 are very common in 
lung cancer cells (Nobori, et a/., Cancer Res. t 53:1098-1101, 1993). 

As described in Example II, to determine whether the CDK4I gene was present 
in, or deleted from, known cancer cell lines, probes corresponding to the CDK4I 
gene were used to rescreen the 45 cancer cell lines referred to above. The 
results of this assay are shown (in a hybridization blot) in Figure 9. For 
reference, probes corresponding to the MTAse, INF-a and INF-b genes, as well 
as the 3.21, 2F t 54F, 71 F, and 3.3B regions on chromosome 9 ( see . Figure 4 (b) 
and Figure 5) were used to screen for the presence of those regions in the same 
cell lines. The complete results of this assay for all gene regions tested are 
tabulated by percentage deletion in Table 1 below, to wit; 61% of melanomas, 
87% of gliomas, 36% of non-small lung cancers and 64% of leukemias were 
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identified as having homozygous deletions of the CDK4I gene. These data 
indicate that human cells contain a single CDK4I gene that is deleted or 
rearranged in the majority of melanomas, gliomas, and leukemias, as well as 
more than a third of non-small cell lung cancers. 
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FREQUENCY AND IDENTITY OF POINT MUTATIONS OF THE CDK4I 
GENE IN TUMOR CELLS 

As discussed in the background section above, the gene encoding the tumor 
suppressor p53 has been found to be deleted in certain cancers, thus allowing 
5 unchecked cellular proliferation to occur. Logically, if a gene encoding a tumor 
suppressor contains a polymorphism that compromises the activity of the 
suppressor, then tumors may develop over time even without deletion of the 
gene encoding the suppressor. In the particular case of the CDK4I gene, its 
presence on chromosome 9p21 suggests that both deletions and polymorphisms 
10 of the gene may contribute to the onset of certain familial and environmental 
cancers. 

More specifically, the role of CDK4I in binding and inhibiting CDK4 indicates that 
an excessive level of kinases can be expected to develop within cells that harbor 
a CDK4I gene deletion or polymorphism that compromises the ability of CDK4I 
15 to inhibit CDK4. Thus, while deletions of the CDK4I gene will be indicative of a 
pre-malignancy or malignancy, polymorphisms in the gene (particularly 
polymorphisms in germline cells of persons with a familial history of 9p21 -linked 
cancers) will be indicative of a susceptibility to develop a "cancer condition 11 (i.e., 
a condition which is causatively related to excessive cellular levels of CDK4). 

20 In its broadest sense, the present invention allows the detection of any 
polymorphism in, or deletion of, a CDK4I target nucleic acid sequence of 
diagnostic or therapeutic relevance, where the target nucleic acid sequence is 
present in a biological cell sample such as that heretofore subjected to 
histopathologic examination using techniques of light microscopy, such as the 

25 margins of a primary tumor or a regional lymph node. Thus, the target nucleotide 
sequence may be, for example, a mutant nucleotide, a restriction fragment length 
polymorphism (RFLP), a nucleotide deletion, a nucleotide substitution, or any 
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* 

other mammalian nucleic acid sequence of interest in such tissue specimens. 
As used herein the term "polymorphism" as applied to a target CDK4I nucleotide 
sequence shall be understood to encompass a mutation, a restriction fragment 
length polymorphism, a nucleic acid deletion, or a nucleic acid substitution. 

5 For example, cells from a human patient who had been diagnosed as suffering 
from familial melanoma (specifically, dysplastic nevus syndrome) were identified 
as containing a nonsense mutation (i.e., a C to T transition) at position 166 of the 
CDK4I mRNA (see, Figure 6 and Example V). In addition, cells from a known 
leukemia cell line (U937; ATCC Accession No. 1593) were screened and found 

10 to contain an intragenic microdeletion of 18 base pairs in the CDK4I5* exon ( see . 
Figure 7 and Example VI). Using the information contained in SEQ ID NO's: 1-2 
and techniques for identifying point mutations in genes which are well-known in 
the art and illustrated herein, those of ordinary skill in the art will be able to 
screen cell samples from particular 9p21 -linked tumors for reproducible 

1 5 polymorphisms and/or deletions of CDK4I to determine genetic susceptibility to, 
as well as the existence of a cancer condition as defined herein (particularly 
melanomas, gliomas, non-small cell lung cancers and ieukemias). 

In the case of deletions and polymorphisms, this information can be used to 
diagnose a pre-cancerous condition or existing cancer condition. Further, by 

20 quantitating the number of cells in successive cell samples which bear and 
acquire the deletion or polymorphism at separate locations in the body and/or 
over time, the progression of a cancer condition can be monitored. Similarly, 
where a deletion or polymorphism is found in a patient who has not yet 
developed symptoms of a cancer condition (particularly one who carries the 

25 abnormality in germline cells and/or has a family history of a particular cancer 
condition), the deletion or polymorphism will be indicative of a genetic susceptibil- 
ity to develop the cancer condition. Such susceptibility can further be evaluated 
on a qualitative basis based on information concerning the prevalence, if any, of 



WO 95/28169 



PCT/US95/04655 



-14- 

the cancer condition in the patient's family history and the presence of other risk 
factors, such as exposure to environmental factors and whether the patient also 
carries cells having a deletion of the gene for MTAse. 



5 



To this end, preferred diagnostic techniques are described below, the use of 
which is illustrated in the Examples provided herein. 
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IV. METHODS FOR DETECTION OF DELETIONS AND POLYMOR- 
PHISMS IN THE CDK41 GENE 

Amplification of the CDK4I gene is generally required to produce detectable 
amounts of any gene present in a biological cell sample; i.e., a fluid or tissue 
5 sample which includes a sample of germline cells (e.g., from blood, skin or hair 
follicles) or somatic cells in a malignant or pre-malignant lesion (e.g., from tissue 
biopsies, sputum or urinary specimens). Following amplification, point mutations 
may be detected by means known to those of ordinary skill in the art such as 
direct sequencing, or oligonucleotide hybridization under conditions that can 

10 detect a single base pair change. Also suitable are the techniques for gel 
electrophoresis of single strand conformation polymorphisms (known in the art 
as "SSCP"; see, e.g., Orita, et a/., Pmc.NatLAcad.ScLUSA, 86:2766-2770, 1989), 
heteroduplex analysis to detect mismatches between double stranded DNA (a 
suitable kit for this protocol is the M MDE Heteroduplex Kit" sold by AT Biochem. 

15 of Malvern, PA.), allele specific PCR ( see , e.g., Wu, et al. t 
Proc.Natl.AcadSci.USA, 86:2757-2760, and restriction fragment length 
polymorphism analysis (known in the art as "RFLP"; see , e.g., Knowlton, et a/., 
Nature, 31S:380-382, 1985). Examples of the application of these techniques to 
detect polymorphisms in the CDK4I gene are provided infra; for further details, 

20 the disclosures of the references referred to in the preceding sentence are 
incorporated herein by this reference. 

Detection of homozygous deletions of the CDK4I gene may be readily detected 
by known PCR techniques, as illustrated further below. However, it is possible 
for a person to be hemizygous for the CDK4I gene, in which case gene dosage 
25 analysis for each exon will be performed. Quantitative PCR techniques known 
in the art may be used to perform this analysis; a preferred technique is 
described below and in Kohsaka, et a/. t Nuc.Acids Res., 21:3469-3472, 1993. 
Examples illustrating the use of the preferred technique to detect point mutations 
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in the CDK4I gene are provided infra] for further reference, the disclosures of the 
Kohsaka, et a/., article and co-pending applications referred to in the preceding 
sentence are incorporated herein by reference. 

The most preferred method for performance of qualitative PCR to detect 
5 deletions and polymorphisms of the CDK4I gene involves use of the PCR-ELISA 
techniques described in infra and in Kohsaka, et a/., supra. Although such PCR- 
ELISA methods are preferred for their sensitivity and simplicity, those of ordinary 
skill in the art will know of, or can readily ascertain, other suitable PCR assays 
(such as are described in "PCR Protocols", Innis, etai, eds., (Academic Press, 
10 1990)). 

A. General Methods for Use in PCR and PCR-based Assays 

When is is desirable to amplify the CDK4I target nucleotide sequence before 
detection, such as a CDK4I nucleotide sequence containing a polymorphism, this 
can be accomplished using oligonucleotide(s) that are primers for amplification. 
15 These unique oligonucleotide primers are based upon identification of the 
flanking regions contiguous with the CDK4I nucleotide sequence containing the 
polymorphism. 

In general, primers for use in PCR-based assays will embrace oligonucleotides 
of sufficient length and appropriate sequence which provides specific initiation 

20 of polymerization of a significant number of nucleic acid molecules containing the 
target nucleic acid under the conditions of stringency for the reaction utilizing the 
primers. In this manner, it is possible to selectively amplify the specific target 
nucleic acid sequence containing the nucleic acid of interest. Specifically, the 
term "primer 11 as used herein refers to a sequence comprising two or more 

25 deoxyribonucleotides or ribonucleotides, preferably at least eight, which 
sequence is capable of initiating synthesis of a primer extension product that is 
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substantially complementary to a target nucleic acid strand. The oligonucleotide 
primer typically contains 15-22 or more nucleotides, although it may contain 
fewer nucleotides as long as the primer is of sufficient specificity to allow 
essentially only the amplification of the specifically desired target nucleotide 
5 sequence (i.e., the primer is substantially complementary). 

Experimental conditions conducive to synthesis include the presence of 
nucleoside triphosphates and an agent for polymerization, such as DNA 
polymerase, and a suitable temperature and pH. The primer is preferably single 
stranded for maximum efficiency in amplification, but may be double stranded. 

10 If double stranded, the primer is first treated to separate its strands before being 
used to prepare extension products. Preferably, the primer is an oligodeoxyribo- 
nucleotide. The primer must be sufficiently long to prime the synthesis of 
extension products in the presence of the inducing agent for polymerization. The 
exact length of primer will depend on many factors, including temperature, buffer, 

15 and nucleotide composition. 

Primers for use in the PCR-based assays of the invention will be designed to be 
"substantially" complementary to each strand of mutant nucleotide sequence to 
be amplified. Substantially complementary means that the primers must be 
sufficiently complementary to hybridize with their respective strands under 
20 conditions which allow the agent for polymerization to function. In other words, 
the primers should have sufficient complementarily with the flanking sequences 
to hybridize therewith and permit amplification of the mutant nucleotide 
sequence. Preferably, the 3* terminus of the primer that is extended has perfectly 
base paired complementarity with the complementary flanking strand. 

25 Oligonucleotide primers used according to the invention are employed in any 
amplification process that produces increased quantities of target nucleic acid. 
Typically, one primer is complementary to the negative (-) strand of the mutant 



WO 95/28169 



PCT/DS95/04655 



-18- 



nucleotide sequence and the other is complementary to the positive (+) strand. 
Annealing the primers to denatured nucleic acid followed by extension with an 
enzyme, such as the large fragment of DNA Polymerase I (Klenow) or Taq DNA 
polymerase and nucleotides or ligases, results in newly synthesized + and - 
5 strands containing the target nucleic acid. Because these newly synthesized 
nucleic acids are also templates, repeated cycles of denaturing, primer 
annealing, and extension results in exponential production of the region (i.e., the 
target mutant nucleotide sequence) defined by the primer. The product of the 
amplification reaction is a discrete nucleic acid duplex with termini corresponding 
10 to the ends of the specific primers employed. Those of skill in the art will know 
of other amplification methodologies which can also be utilized to increase the 
copy number of target nucleic acid. 

The oligonucleotide primers for use in the invention may be prepared using any 
suitable method, such as conventional phosphotriester and phosphodiester 

15 methods or automated embodiments thereof. In one such automated embodi- 
ment, diethylphosphoramidites are used as starting materials and may be 
synthesized as described by Beaucage, et al. (Tetrahedron Letters, 22:1859- 
1862, 1981). One method for synthesizing oligonucleotides on a modified solid 
support is described in U.S. Patent No. 4,458,066. One method of amplification 

20 which can be used according to this invention is the polymerase chain reaction 
(PCR) described in U.S. Patent Nos. 4,683,202 and 4,683,195. 
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The nucleic acid from any biological cell sample, in purified or nonpurified form, 
can be utilized as the starting nucleic acid or acids, provided it contains, or is 
suspected of containing, the specific nucleic acid sequence containing the target 
nucleic acid. Thus, the process may employ, for example, DNA or RNA, 

. 5 including messenger RNA (mRNA), wherein DNA or RNA may be single stranded 
or double stranded. In the event that RNA is to be used as a template, enzymes, 
and/or conditions optimal for reverse transcribing the template to DNA would be 
utilized. In addition, a DNA-RNA hybrid which contains one strand of each may 
be utilized. A mixture of nucleic acids may also be employed, or the nucleic 

10 acids produced in a previous amplification reaction herein, using the same or 
different primers may be so utilized. The mutant nucleotide sequence to be 
amplified may be a fraction of a larger molecule or can be present initially as a 
discrete molecule, such that the specific sequence constitutes the entire nucleic 
acid. It is not necessary that the sequence to be amplified be present initially in 

15 a pure form; it may be a minor fraction of a complex mixture, such as contained 
in whole human DNA. 

Where the target neoplastic nucleotide sequence of the sample contains two 
strands, it is necessary to separate the strands of the nucleic acid before it can 
be used as the template. Strand separation can be effected either as a separate 

20 step or simultaneously with the synthesis of the primer extension products. This 
strand separation can be accomplished using various suitable denaturing 
conditions, including physical, chemical, or enzymatic means; the word 
"denaturing" includes all such means. One physical method of separating nucleic 
acid strands involves heating the nucleic acid until it is denatured. Typical heat 

25 denaturation may involve temperatures ranging from about 80° to 105°C for 
times ranging from about 1 to 10 minutes. Strand separation may also be 
induced by an enzyme from the class of enzymes known as helicases or by the 
enzyme RecA, which has helicase activity, and in the presence of riboATP which 
is known to denature DNA. The reaction conditions suitable for strand separation 
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of nucleic acids with helicases are described by Kuhn Hoffmann-Berling (CSH- 
Quantitative Biology, 43:63, 1978) and techniques for using RecA are reviewed 
in C. Radding (Ann. Rev. Genetics, 16:405-437, 1982). 

If the nucleic acid containing the target nucleic acid to be amplified is single 
5 stranded, its complement is synthesized by adding one or two oligonucleotide 
primers. If a single primer is utilized, a primer extension product is synthesized 
in the presence of primer, an agent for polymerization, and the four nucleoside 
triphosphates described below. The product will be complementary to the 
single-stranded nucleic acid and will hybridize with a single-stranded nucleic acid 
10 to form a duplex of unequal length strands that may then be separated into single 
strands to produce two single separated complementary strands. Alternatively, 
two primers may be added to the single-stranded nucleic acid and the reaction 
carried out as described. 

When complementary strands of nucleic acid or acids are separated, regardless 
15 of whether the nucleic acid was originally double or single stranded, the 
separated strands are ready to be used as a template for the synthesis of 
additional nucleic acid strands. This synthesis is performed under conditions 
allowing hybridization of primers to templates. Generally synthesis occurs in a 
buffered aqueous solution, preferably at a pH of 7-9, most preferably about 8. 
20 Preferably, a molar excess (for genomic nucleic acid, usually about 10 8 :1 
primentemplate) of the two oligonucleotide primers is added to the buffer 
containing the separated template strands. It is understood, however, that the 
amount of complementary strand may not be known if the process of the 
invention is used for diagnostic applications, so that the amount of primer relative 
25 to the amount of complementary strand cannot be determined with certainty. As 
a practical matter, however, the amount of primer added will generally be in 
molar excess over the amount of complementary strand (template) when the 
sequence to be amplified is contained in a mixture of complicated long-chain 
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nucieic acid strands. A large molar excess is preferred to improve the efficiency 
of the process. 

In some amplification embodiments, the substrates, for example, the 
deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP, are added to 
5 the synthesis mixture, either separately or together with the primers, in adequate 
amounts and the resulting solution is heated to about 90°-100°C from about 1 to 
10 minutes, preferably from 1 to 4 minutes. After this heating period, the solution 
is allowed to cool to room temperature, which is preferable for the primer 
hybridization. To the cooled mixture is added an appropriate agent for effecting 

10 the primer extension reaction (called herein "agent for polymerization"), and the 
reaction is allowed to occur under conditions known in the art. The agent for 
polymerization may also be added together with the other reagents if it is heat 
stable. This synthesis (or amplification) reaction may occur at room temperature 
up to a temperature above which the agent for polymerization no longer 

15 functions. Thus, for example, if DNA polymerase is used as the agent, the 
temperature is generally no greater than about 40°C. 

The agent for polymerization may be any compound or system which will function 
to accomplish the synthesis of primer extension products, including enzymes. 
Suitable enzymes for this purpose include, for example, E. coli DNA polymerase 

20 I, Taq polymerase, Klenow fragment of E. coli DNA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, polymerase muteins, reverse 
transcriptase, ligase, and other enzymes, including heat-stable enzymes (i.e., 
those enzymes which perform primer extension after being subjected to 
temperatures sufficiently elevated to cause den atu ration). Suitable enzymes will 

25 facilitate combination of the nucleotides in the proper manner to form the primer 
extension products which are complementary to each mutant nucleotide strand. 
Generally, the synthesis will be initiated at the 3' end of each primer and proceed 
in the 5' direction along the template strand, until synthesis terminates, producing 
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molecules of different lengths. There may be agents for polymerization, 
however, which initiate synthesis at the 5' end and proceed in the other direction, 
using the same process as described above. In any event, the method of the 
invention is not to be limited to the embodiments of amplification which are 
5 described herein. 

The newly synthesized mutant nucleotide strand and its complementary nucleic 
acid strand will form a double-stranded molecule under hybridizing conditions 
described above and this hybrid is used in subsequent steps of the process, in 
the next step, the newly synthesized double-stranded molecule is subjected to 
10 denaturing conditions using any of the procedures described above to provide 
single-stranded molecules. 

The above process is repeated on the single-stranded molecules. Additional 
agent for polymerization, nucleosides, and primers may be added, if necessary, 
for the reaction to proceed under the conditions prescribed above. Again, the 
15 synthesis will be initiated at one end of each of the oligonucleotide primers and 
will proceed along the single strands of the template to produce additional nucleic 
acid. After this step, half of the extension product will consist of the specific 
nucleic acid sequence bounded by the two primers. 

Xhe steps of denaturing and extension product synthesis can be repeated as 
20 often as needed to amplify the target mutant nucleotide sequence to the extent 
necessary for detection. The amount of the mutant nucleotide sequence 
produced will accumulate in an exponential fashion. 



WO 95/28169 



PCT/US95/04655 



-23- 

The amplified product may be detected by Southern blot analysis, without using 
radioactive probes. In such a process, for example, a small sample of DNA 
containing a very low level of mutant nucleotide sequence is amplified, and 
analyzed via a Southern blotting technique. The use of non-radioactive probes 
5 or labels is facilitated by the high level of the amplified signal. 

Nucleic acids having a mutation detected in the method of the invention can be 
further evaluated, detected, cloned, sequenced, and the like, either in solution 
or after binding to a solid support, by any method usually applied to the detection 
of a specific DNA sequence such as PCR, oligomer restriction (Saiki, et aL, 
10 Bio/Technology, 3:1008-1012, 1985), allele-specific oligonucleotide (ASO) probe 
analysis (Conner, et aL, Proc. Natl. Acad. Sci. USA, £0:278, 1983), 
oligonucleotide ligation assays (OLAs) (Landegren, et a/., Science, 241 :1077. 
1 988), and the like. Molecular techniques for DNA analysis have been reviewed 
(Landegren, etaL, Science, 242:229-237. 1988). 

15 B. Hybridization with Labelled Probes 

In another diagnostic method of the invention, purified nucleic acid fragments 
containing intervening sequences or oligonucleotide sequences of 10-50 base 
pairs are radioactively labelled. The labelled preparations are used to probe 
nucleic acid from a biological cell sample by the Southern hybridization 

20 technique. Nucleotide fragments from a biological cell sample, before or after 
amplification, are separated into fragments of different molecular masses by gel 
electrophoresis and transferred to filters that bind nucleic acid. After exposure 
to the labelled probe, which will hybridize to nucleotide fragments containing 
target nucleic acid sequences, binding of the radioactive probe to target nucleic 

25 acid fragments is identified by autoradiography (see Genetic Engineering, 1, ed. 
Robert Williamson, Academic Press, (1981), 72-81). Alternatively, nucleic acid 
from the sample can be bound directly to filters to which the radioactive probe 
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selectively attaches by binding nucleic acids having the sequence of interest. 
Specific sequences and the degree of binding is quantitated by directly counting 
the radioactive emissions. 

Where the target nucleic acid is not amplified, detection using an appropriate 
5 hybridization probe may be performed directly on the separated mammalian 
nucleic acid. In those instances where the target nucleic acid is amplified, 
detection with the appropriate hybridization probe would be performed after 
amplification. 

The probes of the present invention can be used for examining the distribution 
10 of the specific fragments detected, as well as the quantitative (relative) degree 
of binding of the probe for determining the occurrence of specific strongly binding 
(hybridizing) sequences, thus indicating the likelihood for an individual to be at 
low risk or high risk for a cancer condition, such as familial melanoma. 

For the most part, the probe will be detectably labelled with an atom or inorganic 
15 radical, most commonly using radionuclides, but also heavy metals can be used. 
Conveniently, a radioactive label may be employed. Radioactive labels include 
32p 125| a H 14 C m| n 9omj c or the like. Any radioactive label may be employed 
which provides for an adequate signal and has sufficient half-life. Other labels 
include ligands, which can serve as a specific binding pair member for a labelled 
20 ligand, and the like. A wide variety of labels routinely employed in 
immunoassays can readily be employed in the present assay. 

The choice of the label will be governed by the effect of the label on the rate of 
hybridization and binding of the probe to the target nucleotide sequence. It will 
be necessary that the label provide sufficient sensitivity to detect the amount of 
25 target nucleotide sequence available for hybridization. Other considerations will 
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be ease of synthesis of the probe, availability of instrumentation, ability to 
automate, convenience, and the like. 

The manner in which the label is bound to the probe will vary depending upon the 
nature of the label. For a radioactive label, a wide variety of techniques can be 
5 employed. Commonly employed is nick translation with an a 32 P-dNTP or 
terminal phosphate hydrolysis with alkaline phosphatase followed by labeling with 
radioactive M P employing ^P-NTP and T4 polynucleotide kinase. Alternatively, 
nucleotides can be synthesized where one or more of the elements present are 
replaced with a radioactive isotope, e.g., hydrogen with tritium. If desired, 
10 complementary labelled strands can be used as probes to enhance the 
concentration of hybridized label. 

Where other radionucleotide labels are involved, various linking groups can be 
employed. A terminal hydroxyl can be esterified, with inorganic acids, e.g., 32 P 
phosphate, or 14 C organic acids, or else esterified to provide linking groups to the 
15 label. Alternatively, intermediate bases may be substituted with activatable 
linking groups that can then be linked to a label. 

Enzymes of interest as reporter groups will primarily be hydrolases, particularly 
esterases and glycosidases, or oxidoreductases, particularly peroxidases. 
Fluorescent compounds include fluorescein and its derivatives, rhodamine and 
20 its derivatives, dansyl, umbelliferone, and so forth. Chemiluminescers include 
luciferin, and 2, 3-dihydrophthalazinediones (e.g., luminol). 

The probe can be employed for hybridizing to a nucleotide sequence affixed to 
a water insoluble porous support. Depending upon the source of the nucleic 
acid, the manner in which the nucleic acid is affixed to the support may vary. 
25 Those of ordinary skill in the art know, or can easily ascertain, different supports 
that can be used in the method of the invention. 
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The nucleic acid from a biological cell sample is cloned and then spotted or 
spread onto a filter to provide a plurality of individual portions (plaques). The 
fitter is an inert porous solid support, e.g., nitrocellulose. Any cells (or phage) 
present in the specimen are treated to liberate their nucleic acid. The lysing and 
5 denaturation of nucleic acid, as well as the subsequent washings, can be 
achieved with an appropriate solution for a sufficient time to lyse the cells and 
denature the nucleic acid. For lysing, chemical lysing will conveniently be 
employed, as described previously for the lysis buffer. Other denaturation agents 
include elevated temperatures, organic reagents, e.g., alcohols, amides, amines, 
10 ureas, phenols and sulfoxides or certain inorganic ions, e.g., thiocyanate and 
perch I orate. 

After denaturation, the filter is washed in an aqueous buffered solution, such as 
Tris, generally at a pH of about 6 to 8, usually 7. One or more washings may be 
involved, conveniently using the same procedure as employed for the lysing and 
15 denaturation. After the lysing, denaturing, and washes have been accomplished, 
the nucleic acid spotted filter is dried at an elevated temperature, generally from 
about 50°C to 70°C. Under this procedure, the nucleic acid is fixed in position 
and can be assayed with the probe when convenient 

Pre-hybridization may be accomplished by incubating the filter with the 
20 hybridization solution without the probe at a mildly elevated temperature for a 
sufficient time to thoroughly wet the filter. Various hybridization solutions may be 
employed, comprising from about 20% to 60% volume, preferably 30%, of an 
inert polar organic solvent. A common hybridization solution employs about 50% 
formamide, about 0.5 to 1M sodium chloride, about 0.05 to 0.1 M sodium citrate, 
25 about 0.05 to 0.2% sodium dodecylsulfate, and minor amounts of EDTA, ficoll 
(about 300-500 kD), polyvinylpyrrolidone, (about 250-500 kD) and serum 
albumin. Also included in the hybridization solution will generally be from about 
0.5 to 5 mg/ml of sonicated denatured DNA, e.g., calf thymus of salmon sperm; 
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and optionally from about 0.5 to 2% wt/vol glycine. Other additives may also be 
included, such as dextran sulfate of from about 100 to 1,000 kD and in an 
amount of from about 8 to 15 weight percent of the hybridization solution. 

The particular hybridization technique is not essential to the invention. Other 
5 hybridization techniques are described by Gall and Pardue, (Proc. Natl. Acad. 
Sci. §3:378, 1969); and John, et a/., (Nature, 223:582, 1969). As improvements 
are made in hybridization techniques they can readily be applied in the method 
of the invention. 

The amount of labelled probe present in the hybridization solution will vary 
10 widely, depending upon the nature of the label, the amount of the labelled probe 
that can reasonably bind to the filter, and the stringency of the hybridization. 
Generally, substantial excess over stoichiometric concentrations of the probe will 
be employed to enhance the rate of binding of the probe to the fixed target 
nucleic acid. 

15 Various degrees of stringency of hybridization may be employed. The more 
severe the conditions, the greater the complementarity that is required for 
hybridization between the probe and the single stranded target nucleic acid 
sequence for duplex formation. Severity can be controlled by temperature, probe 
concentration, probe length, ionic strength, time, and the like. Conveniently, the 

20 stringency of hybridization is varied by changing the polarity of the reactant 
solution by manipulating the concentration of formamide in the range of 20% to 
50%. Temperatures employed will normally be in the range of about 20°C to 
80°C, usually 30°C to 75°C (see, generally, Current Protocols in Molecular 
Biology, Ausubel, ed., Wiley & Sons, 1989). 

25 After the filter has been contacted with a hybridization solution at a moderate 
temperature for a period of time sufficient to allow hybridization to occur, the filter 



WO 95/28169 



PCT/US95/04655 



-28- 

is then introduced into a second solution having analogous concentrations of 
sodium chloride, sodium citrate and sodium dodecylsulfate as provided in the 
hybridization solution. The time the filter is maintained in the second solution 
may vary from five minutes to three hours or more. The second solution 
5 determines the stringency, dissolving cross duplexes and short complementary 
sequences. After rinsing the filter at room temperature with dilute sodium citrate- 
sodium chloride solution, the filter may now be assayed for the presence of 
duplexes in accordance with the nature of the label. Where the label is 
radioactive, the filter is dried and exposed to X-ray film. 

10 The label may also comprise a fluorescent moiety that can then be probed with 
a specific antifluorescent antibody. For example, horseradish peroxidase 
enzyme can be conjugated to this antibody to catalyze a chemiluminescent 
reaction. Production of light can then be seen on rapid exposure to film. 

C. Preferred. Competitive PCR-based Assays 

15 The preferred method for performance of quantitative PCR in the invention is a 
competitive PCR technique performed using a competitor template containing an 
induced mutation of one or more base pairs which results in the competitor 
differing in sequence (but not size) from the target CDK4I gene template. One 
of the primers is biotinylated or, preferably, aminated so that one strand (usually 

20 the antisense strand) of the resulting PCR product can be immobilized via an 
amino-carboxyi, amino-amino, biotin-streptavidin or other suitably tight bond to 
a solid phase support which has been tightly bound to an appropriate reactant. 
Most preferably, the bonds between the PCR product, solid phase support and 
reactant will be covalent ones, thus reliably rendering the bonds resistant to 

25 uncoupling under denaturing conditions. 



I 
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Once the aminated or biotinylated strands of the PCR products are immobilized, 
the unbound complementary strands are separated in an alkaline denaturing 
wash and removed from the reaction environment. Sequence-specific 
oligonucleotides ("SSO's") corresponding to the target and competitor nucleic 
5 acids are labelled with a detection tag. The SSO's are then hybridized to the 
antisense strands in absence of competition from the removed unbound sense 
strands. Appropriate assay reagents are added and the degree of hybridization 
is measured by ELISA measurement means appropriate to the detection tag and 
solid phase support means used, preferably an ELISA microplate reader. The 
1 0 measured values are compared to derive target nucleic acid content, using a 
standard curve separately derived from PCR reactions amplifying templates 
including target and competitor templates. 

This method is advantageous in that it is quantitative, does not depend upon the 
number of PCR cycles, and is not influenced by competition between the SSO 
1 5 probe and the complementary strand in the PCR product. 

Alternatively, part of the polymerization step and all of the hybridization step can 
be performed on a solid phase support. In this method, it is an nucleotide 
polymerization primer (preferably an oligonucleotide) which is captured onto a 
solid phase support rather than a strand of the PCR products. Target and 
20 competitor nucleic acid PCR products are then added in solution to the solid 
phase support and a polymerization step is performed. The unbound sense 
strands of the polymerization product are removed under the denaturing 
conditions described above. 

A target to competitor nucleic acid ratio can be determined by detection of 
25 labelled oligonucleotide SSO probes using appropriate measurement means 
(preferably ELISA readers) and standard curve as described supra. The 
efficiency of this method can be so great that a chain reaction in the 
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polymerization step may be unnecessary, thus shortening the time needed to 
perform the method. The accuracy of the method is also enhanced because the 
final polymerization products do not have to be transferred from a reaction tube 
to a solid phase support for hybridization, thus limiting the potential for their loss 
5 or damage. If necessary for a particular sample, however, the PCR may be used 
to amplify the target and competitor nucleic acids in a separate reaction tube, 
followed by a final polymerization performed on the solid phase support. 

An additional alternative to the above described techniques performs the 
polymerization step in a single step on a solid phase support. In this method, the 

10 PCR is performed to amplify the target (and where a quantitative analysis is 
desired, the competitor) nucleic acid on a solid phase support. Before the PCR 
is performed, primers (which correspond to the target and competitor nucleic 
acids) are tightly bound to the solid phase support. Two additional primers are 
placed into solution with the target nucleic acid (or three primers where a 

15 competitive template is present). 

As the PCR begins, the templates do not interact with the bound primer to a 
substantial degree because template concentration is relatively low and the 
bound primer is not readily accessible. However, as the templates are amplified, 
more of the PCR products become bound to the solid phase via hybridization 
20 with the bound primer. In essence, therefore, the bound primers serve as 
hybridization probes for the PCR products formed by priming of the target and 
competitor nucleic acids. Once hybridization occurs, the hybridization primer 
elongates via the PCR. 

Molecules capable of providing different, detectible signals indicative of the 
25 formation of bound PCR products known to those skilled in the art (such as the 
labels described supra as well as labelled nucleotide chromophores which will 
form different colors indicative of the formation of target and competitor PCR 
products) can be added to the reaction solution during the last few cycles of the 
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reaction. The ratio between the target and competitor nucleic acids can also be 
determined by ELISA or other appropriate measurement means and reagents 
reactive with detection tags coupled to the 3' end of the immobilized hybridization 
primers. This method may also be adapted to detect whether a particular gene 
5 is present in the sample (without quantifying it) by performing a conventional 
noncompetitive PCR protocol. 

Those of ordinary skill in the art will know, or may readily ascertain, how to select 
suitable primers for use in the above methods. For example, primers which will 
amplify the CDK4I gene and correspond to the CDK4P, CDK4I3' and CDK4I5* 
10 exons are described in SEQ.ID.Nos.8-13. 

D. Sinale-stran d Conformation Polymorphism Analysis 

Techniques to detect DNA polymorphisms based on restriction fragment length 
polymorphism analysis (RFLP) and electrophoresis gel mobility shifts caused by 
single nucleotide substitution in single-stranded DNA (SSCP) have proved to be 

15 useful methods for distinguishing allelic variations at chromosomal loci. For 
example, RFLP has been used to detect genetic abnormalities present in cystic 
fibrosis and other hereditary disorders ( see , e.g., Knowlton, et a/., Nature, 
318:380-382 [re use of RFLP to detect cystic fibrosis], and Shiraishi, et a/., 
JpnJ.CancerRes., 78:1302-1 308, 1987 [re performance of RFLP generally], the 

20 disclosures of which are incorporated herein by this reference to illustrate 
knowledge in the art concerning the use of RFLP). However, RFLP requires that 
the polymorphisms of interest be present in the recognition sequences for the 
corresponding restriction endonucleases or when deletion or insertion of a short 
sequence is present in the region detected by a particular probe. Therefore, 

25 SSCP is a preferred technique for detection of allele-specific polymorphisms. 
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The technique for performance of SSCP is well-known in the art (see , e.g., Orita, 
et a/., Genetics, 86:2766-2770, 1989, the disclosure of which is incorporated 
herein by this reference to illustrate knowledge in the art concerning the use of 
SSCP). Generally, gene fragments or alleles of interest are denatured and 
5 subjected to electrophoresis in a neutral polyacrylamide gel. Single-stranded 
DNA's (or RNA copies thereof) are transferred to a membrane (by blotting) and 
hybridized with detectably labelled DNA probes for the fragments/alleles of 
interest. The relative speed in which the fragments/alleles of interest move in the 
gel ("mobility shift") is indicative of the presence or absence of base substitutions. 

10 A particularly suitable SSCP technique is one which uses the PCR is used to 
simultaneously amplify the target sequence and label it with a radioisotope or, 
preferably, a fluorescein molecule (using labelled primers in the PCR; i.e., "F- 
PCR-SSCP"). Most preferably, detection of bands of DNA in a polyacrylamide 
gel will be performed with an automatic DNA sequencer, which permits strict 

15 control of the gel at any desired temperature and allows for quantitative 
interpretation of the resulting data (based on the proportionality of the heights of 
the peaks in the fluorogram to the intensity of the fluorescence emitted by the 
labelled DNA). For a summary of the known method for performance of F-PCR- 
SSCP, those of skill in the art may wish to consult Makino, et at., PCR Methods 

20 andApplns., 2:10-13 (Cold Spring Harbor Lab., 1992), the disclosure of which 
is incorporated herein by this reference to illustrate knowledge in the art 
concerning F-PCR-SSCP. 

E. Allele-spec ific Enzvmatic Amplification of Genomic DNA 

A simple, and therefore preferred, method of detecting polymorphisms in 
25 genomic DNA is a technique which is based on a allele-specific PCR (ASPCR). 
In ASPCR, two allele-specific oligonucleotide primers (such as those described 
in SEQ ID NO's: 8-13), one of which is specific for the suspected and/or known 
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mutated allele, the other of which is specific for the "normal" allele, are used in 
the PCR with genomic DNA templates and another primer which is 
complementary to both alleles. Under the proper annealing temperature and 
PCR conditions, the primers will only direct amplification of their complementary 
5 allele, thus allowing for the determination of genotypes in nucleic acid samples 
obtained from human tissue. More particularly, suitable temperatures for this 
PCR are about 55°C for the annealing cycles, about 72°C for the polymerization 
cycles, and about 94°C for the heat-denaturation cycles. 

For further details concerning performance of the ASPCR, those of skill in the art 
10 may wish to consult Wu, et a/., Proc. Natl. Acad. ScL USA, 86:2757-2760, 1989, 
the disclosure of which is incorporated herein by this reference. 

F. Indirect Detection of Gene Deletions Based on 

the Absence of CDK4I In a Biol ogical Cell Sample 

In a normal, non-malignant cell, CDK4I can be expected to be present, usually 
15 in bound form; i.e., in a complex of CDK4I, CDK4, cyclin D and other molecules, 
such as a cell nuclear antigen. Methods for indirect detection of a deletion of the 
gene for CDK4I based on the absence of the CDK4 protein (as determined by, 
preferably, immunoassay) are described in further detail below at Section VIII. 

V. ISOLATION AND PURIFICATION OF CDK4I 

20 The term "substantially pure" as used herein denotes a protein which is substan- 
tially free of other compounds with which it may normally be associated in vivo. 
In the context of the invention, the term refers to homogenous CDK4I, which 
homogenicity is determined by reference to purity standards known to those of 
ordinary skill in the art (e.g., purity sufficient to allow the N-terminal amino acid 

25 sequence of the protein to be obtained). 
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Substantiaily pure CDK4I may be obtained from tissue homogenates (containing 
"normal" cells; i.e., those cells which contain the CDK4I gene), through microbial 
expression, by synthesis, or by purification means known to those skilled in the 
art, such as affinity chromatography. Such techniques may be utilized to obtain 
5 biologically active peptide fragments of CDK4I. In this context, "biologically 
active peptide fragments" refers to fragments which contain a binding domain for 
CDK4. 



Determination that a CDK4I fragment contains a CDK4 binding domain may be 
made by use of any of several methods known to those skilled in the art, 

10 including determination of the binding kinetics and affinity of the fragment for 
CDK4 as well as inhibition studies using anti-CDK4 antibodies ( see , e.g., Xiong, 
et a/., Genes Dev., 7:1572-1583, 1993, the disclosure of which is incorporated 
herein by this reference to illustrate a standard method for production of anti- 
CDK4 antibodies; other suitable methods for antibody production which may be 

15 adapted to produce anti-CDK4 antibodies are described infra). 
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Minor modifications of the primary amino acid sequence of CDK4I (which may be 
readily derived from SEQ.ID.Nos. 1-2) may result in variants which have substan- 
tially equivalent activity as compared to the specific CDK4I protein described 
herein. Such modifications may be deliberate, as by site-directed mutagenesis, 
5 or may be spontaneous. All of the variants produced by these modifications are 
included herein as long as biological activity present in the original protein still 
exists. For purposes of this disclosure, such variants shall be generally 
considered to be 'functional variants". Functional amino acid sequence variants 
of CDK4I may fall into one or more of three classes; substitutional, insertional or 
10 deletionai variants. Such variants ordinarily are prepared by site-specific 
mutagenesis of nucleotides in the DNA encoding CDK4I hereby producing DNA 
encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture. However, variant CDK4I and CDK4I fragments having up to about 100- 
150 residues may be conveniently prepared by in vitro synthesis. 

Amino acid sequence variants are ordinarily characterized by the intended nature 
of the variation, but such variants also include naturally occurring allelic or 
interspecies variation of the CDK4I amino acid sequence. The variants typically 
exhibit the same qualitative biological activity as the naturally-occurring analogue, 
although variants may also be selected in order to modify the characteristics of 
CDK4I as will be more fully described below. 

While the site for introducing an amino acid sequence variation is predetermined, 
the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may 
be directed at the target codon or region and the expressed CDK4I variants 
25 screened for the optimal combination of desired activity. Techniques for making 
substitution mutations at particular sites in DNA having a known sequence are 
well known, for example M13 primer mutagenesis. Amino acid substitutions are 
typically of single residues; insertions usually will be on the order of about from 
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1 to 10 amino acid residues; and deletions will usually range about from 1 to 30 
residues. Deletions or insertions preferably are made in adjacent pairs, i.e., a 
deletion of 2 residues or insertion of 2 residues. 

Substitutions, deletions, insertions or any combination thereof may be combined 
5 to arrive at a final construct. Obviously, the mutations that will be made in the 
DNA encoding the variant CDK4I must not place the sequence out of reading 
frame ( see . SEQ.ID.Nos: 1-2). 

Substitutional variants are those in which at least one residue in SEQ ID No. 2 
has been removed and a different residue inserted in its place. These may be 

1 0 made to eliminate glycosylation sites in the sequence, to alter the pH, to increase 
the stability of the protein, or to accomplish other desirable modifications in the 
protein, which modifications will be apparent to those of ordinary skill in the art. 
For example, oxidative stability of CDK4I may be achieved by deletion of cysteine 
or other labile residues. Deletion or substitution of potential proteolysis sites can 

15 also be accomplished by deleting such residues or substituting a glutaminyl or 
histidyl residue. 

Insertional amino acid sequence variants of CDK4I are those in which one or 
more amino acid residues are introduce into a predetermined site in the target 
receptor. Most commonly, insertional variants are fusions of heterologous 

20 proteins or polypeptides to the amino or carboxyl terminus of the protein to be 
varied. For example, immunogenic CDK4I derivatives may be made by fusing an 
immunogenic polypeptide to the target sequence by cross-linking in vitro or by 
recombinant cell culture transformed with DNA encoding the fusion. Such 
immunogenic polypeptides preferably are bacterial polypeptides such as trpLE, 

25 beta-galactosidase and the like, together with their immunogenic fragments. 

CDK4I of the invention also includes amino acid sequence mutants, glycosylation 
variants and covalent or aggregative conjugates with other chemical moieties. 
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Covalent derivatives of CDK4I may also be prepared by linkage of functional 
moieties to groups which are found in the receptor's amino acid side chains or 
at the N, or C-termini, by means known in the art. These derivatives may, for 
example, include aliphatic esters or amides of the carboxyl terminus or residues 
- 5 containing carboxyl side chains, O-acyl derivatives of hydroxyl group-containing 
residues, and N-acyl derivatives of the amino terminal amino acid or amino-group 
containing residues, e.g. lysine or arginine. 

Another group of derivatives are covalent conjugates of CDK4I and CDK4I 
fragments with other proteins or polypeptides. These derivatives may be 

10 synthesized by one of ordinary skill in the art in recombinant culture as N, or C- 
terminal fusions or by the use of dysfunctional agents known per se for use in 
cross-linking proteins to insoluble matrices through reactive side-groups. 
Covalent or aggregative derivatives will be useful as immunogens, reagents in 
immunoassay or for affinity purification of CDK4I. For example, CDK4I 

1 5 insolubilized by covalent bonding to cyanogen bromide-activated "SEPHAROSE" 
(agarose tradename) by known methods or adsorbed to polyoefin surfaces may 
be used in an assay or in purification of anti-CDK4l antibodies or CDK4I ligand. 



With reference to SEQ.ID.Nos: 1-2, CDK4I protein and peptides can be identified 
and synthesized by such commonly used methods as t-BOC or FMOC protection 

20 of alpha-amino groups. Both methods involve stepwise syntheses whereby a 
single amino acid is added at each step starting from the C terminus of the 
peptide ( see . Coligan, et aL, Current Protocols in Immunology, Wiley 
Interscience, 1991, Unit 9). Peptides of the invention can also be synthesized 
by various well known solid phase peptide synthesis methods, such as those 

25 described by Merrifield (J. Am. Chem. Soc, 815:2149, 1962), and Stewart and 
Young (Solid Phase Peptides Synthesis, Freeman, San Francisco, 1969, pp 27- 
62), using a copoly (styrene-divinylbenzene) containing 0.1-1.0 mMol amines/g 



WO 95/28169 



PCTAJS95/04655 



-38- 

polymer. On completion of chemical synthesis, the peptides can be deprotected 
and cleaved from the polymer by treatment with liquid HF-10% anisole for about 
1/4-1 hours at 0°C. After evaporation of the reagents, the peptides are extracted 
from the polymer with 1 % acetic acid solution which is then lyophilized to yield 
the crude material. This can normally be purified by such techniques as gel 
filtration on a "SEPHADEX G-15" or "SEPHAROSE" affinity column. 
Lyophilization of appropriate fractions of the column will yield the homogeneous 
peptide or peptide derivatives, which can then be characterized by such standard 
techniques as amino acid analysis, thin layer chromatography, high performance 
liquid chromatography, ultraviolet absorption spectroscopy, molar rotation, 
solubility, and quantitated by the solid phase Edman degradation. 

Compositions comprising CDK4I may include such substances as the stabilizers 
and excipients described below, predetermined amounts of proteins from the cell 
or organism that served as the source of the CDK4I gene, proteins from other 
15 than CDK4I source cells or organisms, and synthetic polypeptides such as poly- 
L-lysine. Recombinant CDK4I which is expressed in allogeneic hosts will of 
course be expressed completely free of gene source proteins. For example, 
expression of human CDK4I in Chinese Hamster Ovary (CHO) cells or other 
nonhuman higher mammalian cells results in a composition where the receptor 
20 is free of contaminating agents and human proteins. 
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VI. CDK4I DNA SEQUENCES AND EXPRESSION PRODUCTS 

The invention also provides polynucleotides which encode CDK4I. As used 
herein, "polynucleotide" refers to a polymer of deoxyribonucleotides or 
ribonucleotides, both single-stranded (including sense and antisense strands) 
5 and double-stranded, in the form of a separate fragment or as a component of 
a larger construct. DNA encoding a peptide of the invention can be assembled 
from cDNA fragments or from oligonucleotides which provide a synthetic gene 
which is capable of being expressed in a recombinant transcriptional unit. 
Polynucleotide sequences of the invention include genomic DNA, RNA and 
10 cDNA sequences. A polynucleotide sequence can be deduced from the genetic 
code, however, the degeneracy of the code must be taken into account. 
Polynucleotides of the invention include sequences which are degenerate as a 
result of the genetic code. 

As described in further detail below, polynucleotide sequences encoding CDK4I 
can be expressed in either prokaryotes or eukaryotes. Hosts can include 
microbial yeast, insect and mammalian organisms. Methods of expressing DNA 
sequences having eukaryotic or viral sequences in prokaryotes are well known 
in the art. Biologically functional viral and plasmid DNA vectors capable of 
expression and replication in a host are known in the art. Such vectors (i.e., 
"recombinant expression vectors") are used to incorporate DNA sequences of the 
invention. These sequences may also be contained in "host cells", i.e., 
transformed cells such as CHO and COS cells (e.g., ATCC Accession No. CRL 
1651) for use in gene expression. 

DNA encoding CDK4I is obtained from sources other than humans by a) 
25 obtaining a cDNA library from mammalian tissue b) conducting hybridization 
analysis with labelled DNA encoding human growth hormone receptor and 
binding protein or fragments thereof (usually, greater than 100 bp) in order to 
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detect clones in the cDNA library containing homologous sequences, and c) 
analyzing the clones by restriction enzyme analysis and nucleic acid sequencing 
to identify full-length clones. If full length clones are not present in the library, 
then appropriate fragments may be recovered from the various clones and 
5 ligated at restriction sites common to the clones to assemble a full-length clone. 

DNA which encodes CDK4I is obtained by chemical synthesis, by screening 
reverse transcripts of mRNA from placental cells or cell line cultures, or by 
screening genomic libraries from any cell. Also included within the scope of the 
invention is nucleic acid which may not encode CDK4I but which nonetheless is 

1 0 capable of hybridizing with DNA encoding CDK4I under low stringency conditions 
(e.g. "primers" or "probes"). The probes and primers of the invention will 
generally be oligonucleotides; i.e., either a single stranded polydeoxynucleotide 
or two complementary polydeoxynucleotide strands which may be chemically 
synthesized. Such synthetic oligonucleotides have no 5" phosphate and thus will 

15 not ligate to another oligonucleotide without adding a phosphate with an ATP in 
the presence of a kinase. A synthetic oligonucleotide will ligate to a fragment 
that has not been dephosphorylated. Such oligonucleotides may be detectably 
labelled with a detectable substance such as a fluorescent group, a radioactive 
atom or a chemiluminescent group by known methods and used in conventional 

20 hybridization assays. Such assays are employed in in vitro diagnosis, such as 
detection of CDK4I DNA or mRNA in tissue samples. 

In general, prokaryotes are used for cloning of DNA sequences in constructing 
CDK4I expressing recombinant expression vectors. For example, E. coli K12 
strain 294 (ATCC Accession No. 31446) may be particularly useful. Prokaryotes 
25 also are used for expression. The aforementioned strain, as well as E. coli 
W3110 (ATTC Accession No. 27325), bacilli such as Bacillus subtilus, and other 
enterobacteriaceae such as Salmonella typhimurium or Serratia marcescans, 
and various pseudomonas species may also be used for expression. 
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ln general, plasmid vectors which may be used in the invention contain 
promoters and control sequences which are derived from species compatible 
with the host cell. The vector ordinarily carries a replication site as well as 
marker sequences which are capable of providing phenotypic selection in 
5 transformed cells. For example, E. coli is typically transformed using pBR322, 
a plasmid derived from an E. coli species (Bolivar, et al., Gene, 2:95, 1977). 
pBR322 contains genes for ampicillin and tetracycline resistance and thus 
provides easy means for identifying transformed cells. The pBR322 plasmid, or 
other microbial plasmid must also contain or be modified to contain promoters 
10 and other control elements commonly used in recombinant DNA construction. 

Promoters suitable for use with prokaryotic hosts illustratively include the p- 
lactamaseand lactose promoter systems (Chang, etai., Nature, 275:615, 1978; 
and Goeddel, et al., Nature, 281:544, 1979), alkaline phosphatase, the 
tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res., 8:4057, 1980) 

15 and hybrid promoters such as the taq promoter (de Boer, etai, Proc. Natl. Acad. 
Sci. USA, 80:21-25, 1983). However, other functional bacterial promoters are 
suitable. Their nucleotide sequences are generally known in the art, thereby 
enabling a skilled worker to ligate them to DNA encoding CDK4I (Siebenlist, et 
al, Cell, 20:269, 1 980) using linkers or adapters to supply any required restriction 

20 sites. 

In addition to prokaryotes, eukaryotic microbes such as yeast cultures may also 
be used. Sacchammyces cerevisiae, or common baker's yeast is the most 
commonly used eukaryotic microorganism, although a number of pther strains 
are commonly available. 

25 Suitable promoting sequences for use with yeast hosts include the promoters for 
3-phosphoglycerate kinase (Hitzeman, etai, J. Biol. Chem., 255:2073, 1980) or 
other glycolytic enzymes (Hess, et al. J. Adv. Enzyme Reg. 7:149, 1968; and 
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Holland, Biochemistry, 17:4900, 1978) such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate 
mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
5 isomerase, and giucokinase. 

Other yeast promoters, which are inducible promoters having the additional 
advantage of transcription controlled by growth conditions, are the promoter 
regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, 
degraded enzymes associated with nitrogen metabolism, metallothionine, 
10 glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for 
maltose and galactose utilization. Yeast enhancers also are advantageously 
used with yeast promoters. 

"Control region" refers to specific sequences at the 5* and 3'ends of eukaryotic 
genes which may be involved in the control of either transcription or translation. 

15 Virtually all eukaryotic genes have an AT-rich region located approximately 25 
to 30 bases upstream from the site where transcription is initiated. Another 
sequence found 70 to 80 bases upstream from the start of transcription of many 
genes is a CCAAT region where X may be any nucleotide. At the 3'end of most 
eukaryotic genes is an AATAAA sequence which may be the signal for additional 

20 of the poly A tail to the 3*end of the transcribed mRNA. 

Preferred promoters controlling transcription from vectors in mammalian host 
cells may be obtained from various sources, for example, the genomes of viruses 
such as polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B 
virus and most preferably cytomegalovirus, or from heterologous mammalian 
25 promoters, e.g. beta actin promoter. The early and later promoters of the SV40 
virus are conveniently obtained as an SV40 restriction fragment which also 
contains the SV40 viral origin of replication (Fiers, et a/, Nature, 273:1 1 3, 1 978). 
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The immediate early promoter of the human cytomegalovirus is conveniently 
obtained as a Hindlll E restriction fragment (Greenaway, et al., Gene t 18:355- 
360, 1982). Promoters from the host cell or related species also are useful 
herein. 

5 Transcription of a DNA encoding CDK4I by higher eukaryotes is increased by 
inserting an enhancer sequence into the vector. Enhancers are cis-acting 
elements of DNA, usually about from 10-300 bp, that act on a promoter to 
increase its transcription. Enhancers are relatively orientation and position 
independent having been found 5 1 (Laimins, et al., Proc.Natl.Sci.Acad.USA, 

10 78:993, 1981) and 3' (Lusky, et al., Mol. Cell Bio., 3:1108, 1983) to the 
transcription unit, and within an intron (Banerji, et al, Cell, 33:729, 1983) as well 
as within the coding sequence itself (Osborne, et al., Mol.Cell Bio., 4:1293 1984). 
Many enhancer sequences are now known from mammalian gene (globin, 
elastase, albumin, a-feto-protein and insulin). Typically, however, an enhancer 

15 from a eukaryotic cell virus will be used. Examples include the SV40 enhancer 
on the late side of the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
20 animal, human or nucleated cells from other multicellular organisms) will also 
contain sequences necessary for the termination of transcription which may 
affect mRNA expression. Expression vectors may also contain a selection gene, 
also termed a selectable marker. Examples of suitable selectable markers for 
mammalian cells which are known in the art include dihydrofolate reductase 
25 (DHFR), thymidine kinase or neomycin. When such selectable markers are 
successfully transferred into a mammalian host cell, the transformed mammalian 
host cell can survive if placed under selective pressure, (i.e., by being conferred 
with drug resistance or genes altering the nutrient requirements of the host cell). 
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Suitable host cells for transformation with and expression of the vectors of this 
invention encoding CDK4I in higher eukaryotes include: monkey kidney CV1 line 
transformed by SV40 (ATCC CRL 1651); human embryonic kidney line (Graham, 
F.L., et a/., J. Gen ViroL, 36:59, 1977); baby hamster kidney cells (ATCC CCL 
5 10); Chinese hamster ovary-cells-DHFR (Urlaub and Chasin, Proc. Nat'l Sci. 
Acad. USA, 77:4216, 1980); mouse Sertoli cells (Mather, J. P., BioLReprod., 
23:243-251, 1980); monkey kidney cells (ATCC CCL 70); african green monkey 
kidney cells (ATCC CRL-1587); human cervical carcinoma cells (ATCC CCL 2); 
canine kidney cells (ATCC CCL 34); buffalo rat liver cells (ATCC CRL 1442); 
10 human lung cells (ATCC CCL 75); human liver cells (HB 8065); mouse mammary 
tumor (ATCC CCL51); and TRI cells (Mather, et a/., Annals N.Y. Acad. Sci., 
383:44-68, 1982). 

'Transformation" means introducing DNA into an organism so that the DNA is 
replicable, either as an extrachromosomal element or by chromosomal 
integration, such as described in Graham, et aA, Virology, 52:456-457, 1973. 
However, other methods for introducing DNA into cells such as by nuclear 
injection or by protoplast fusion may also be used. If prokaryotic cells or cells 
which contain substantial cell wall constructions are used, transfection may be 
achieved by means well known in the art such as calcium treatment using 
calcium chloride as described by Cohen, F.N., et al., (Proc. Nat'l Acad. Sci. USA, 
69:2110, 1972). A particularly convenient method of transforming host cells is 
by lipofection using, for example, the liposomal product or DOTMA (a 
trademarked product of Bethesda Research Labs, Gaithersberg, MD). 

'Transfection" refers to the taking up of an expression vector by a host cell 
25 whether or not any coding sequences are in fact expressed. Numerous methods 
of transfection are known to the ordinarily skilled artisan using, for example, 
CaPQ 4 or electroporation. Successful transfection is generally recognized when 
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any indication of the operation of the transfected vector occurs within the host 
cell. 

Construction of suitable vectors containing the desired coding and control 
sequences employ standard ligation techniques. Isolated plasmids or DNA 
5 fragments are cleaved, tailored, and relegated in the form desired to form the 
plasmids required. 

For example, for analysis to confirm correct sequences in plasmids constructed, 
the ligation mixtures may be used to transform a host cell and successful 
transformants selected by ampicillin or tetracycline resistance where appropriate. 

10 Plasmids from the transformants are prepared, analyzed by restriction and/or 
sequenced by, for example, the method of Messing, et al., (Nucleic Acids f?es., 
9:309, 1981), the method of Maxam, et a/., (Methods in Enzymology, 65:499, 
1980), or other suitable methods which will be known to those skilled in the art. 
Size separation of cleaved fragments is performed using conventional gel 

1 5 electrophoresis as described, for example, by Maniatis, et at., (Molecular Cloning, 
pp. 133-134, 1982). 

Host cells may be transformed with the expression vectors of this invention and 
cultured in conventional nutrient media modified as is appropriate for inducing 
promoters, selecting transformants or amplifying genes. The culture conditions, 
20 such as temperature, pH and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

With reference to SEQ ID NO's: 1-2, production of polynucleotides by the 
aforementioned techniques is well within the skill of one of ordinary skill in the art. 
The invention therefore encompasses CDK4I polynucleotides obtained by such 
25 techniques. 
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VII. CDK4I ANTIBODIES 

The invention also encompasses polyclonal and monoclonal antibodies which 
specifically bind to CDK4I. Such antibodies can be biologically produced through 
immunization of a mammal with CDK4I (including antigenic fragments thereof 
5 and fusion proteins), hereafter "immunogenic CDK4I". 

A multiple injection immunization protocol is preferred for use in immunizing 
animals with immunogenic CDK4I (see , e.g., Langone, et a/., eds. f "Production 
of Antisera with Small Doses of Immunogen: Multiple Intradermal Injections", 
Methods of Enzymology, Acad. Press, 1981). For example, a good antibody 
10 response can be obtained in rabbits by intradermal injection of 1 mg of 
immunogenic CDK4I emulsified in Complete Freund's Adjuvant followed several 
weeks later by one or more boosts of the same antigen in incomplete Freund's 
Adjuvant. 

If desired, immunogenic CDK4I molecules may be coupled to a carrier protein by 
15 conjugation using techniques which are well-known in the art. Such commonly 
used carriers which are chemically coupled to the molecules include keyhole 
limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), and 
tetanus toxoid. The coupled molecule is then used to immunize the animal (e.g., 
a mouse or a rabbit). 

20 Polyclonal antibodies produced by the immunized animals can be further purified, 
for example, by binding to and elution from a matrix to which the peptide to which 
the antibodies were raised is bound. Those of skill in the art will know of various 
techniques common in the immunology arts for purification and/or concentration 
of polyclonal antibodies, as well as monoclonal antibodies ( see , for example, 

25 Coligan, et a/., Current Protocols in Immunology, Unit 9, (Wiley Interscience, 
1991)). 
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For their specificity and ease of production monoclonal antibodies will be 
preferred for use in detecting CDK4I in analyte samples (e.g., tissue samples and 
cell lines). For preparation of monoclonal antibodies, immunization of a mouse 
or rat is preferred. The term "antibody" as used in this invention is meant also to 
5 include intact molecules as well as fragments thereof, such as for example, Fab 
and F(ab*) 2 , which are capable of binding the epitopic determinant. Also, in this 
context, the term "mAb's of the invention" refers to monoclonal antibodies with 
specificity for CDK4I. 

The general method used for production of hybridomas secreting monoclonal 
10 antibodies ("mAb's") is well known (Kohler and Milstein, Nature, 256:495, 1975). 
Briefly, as described by Kohler and Milstein, the technique comprised isolation 
of lymphocytes from regional draining lymph nodes of five separate cancer 
patients with either melanoma, teratocarcinoma or cancer of the cervix, glioma 
or lung. The lymphocytes were obtained from surgical specimens, pooled, and 
1 5 then fused with SHFP-1 . Hybridomas were screened for production of antibody 
which bound to cancer cell lines. An equivalent technique can be used to 
produce and identify mAb's with specificity for CDK4I. 

Confirmation of CDK4I specificity among mAbs of the invention can be 
accomplished using relatively routine screening techniques (such as the enzyme- 
20 linked immunosorbent assay, or "ELISA") to determine the elementary reaction 
pattern of the mAb of interest. 

It is also possible to evaluate an mAb to determine whether is has the same 
specificity as mAb of the invention without undue experimentation by determining 
whether the mAb being tested prevents a mAb of the invention from binding to 
25 CDK4I. If the mAb being tested competes with the mAb of the invention, as 
shown by a decrease in binding by the mAb of the invention, then it is likely that 
the two monoclonal antibodies bind to the same or a closely related epitope. 
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Still another way to determine whether a mAb has the specificity of a mAb of the 
invention is to pre-incubate the mAb of the invention with an antigen with which 
it is normally reactive, and determine if the mAb being tested is inhibited in its 
ability to bind the antigen. If the mAb being tested is inhibited then, in all 
5 likelihood, it has the same, or a closely related, epitopic specificity as the mAb 
of the invention. As noted further below, this same general technique may also 
be used to screen potential CDK4I ligand. 

Methods known in the art also allow antibodies which will specifically bind a 
preselected ligand to be identified and isolated from antibody expression 

10 libraries. For example, a method for the identification and isolation of an 
antibody binding domain which exhibits binding with a peptide of the invention is 
the bacteriophage y vector system. This vector system has been used to 
express a combinatorial library of Fab fragments from the mouse antibody 
repertoire in Escherichia colt (Huse, etaL, Science, 246 :1275-1281. 1989) and 

15 from the human antibody repertoire Mullinax, et a/., (Proc.NaVIAcad.Sci. USA, 
87:8095-8099, 1990). As described therein, antibodies which bound a preselect- 
ed ligand were identified and isolated from these antibody expression libraries. 
This methodology can also be applied to hybridoma cell lines expressing 
monoclonal antibodies which bind for a preselected ligand. 
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This invention further provides chimeric antibodies of the CDK4l-specific 
antibodies described above or biologically active fragments thereof. As used 
herein, the term "chimeric antibody" refers to an antibody in which the variable 
regions of antibodies derived from one species are combined with the constant 
5 regions of antibodies derived from a different species or alternatively refers to 
CDR grafted antibodies. Chimeric antibodies are constructed by recombinant 
DNA technology and are described, for example, in Shaw, et a/., J. Immun., 
138:4534, 1987, and Sun, LK., et a/., Proc.Natl.Acad.ScL USA, 84:214-218, 
1987. 

10 In addition, methods of producing chimeric antibody molecules with various 
combinations of "humanized" antibodies are known in the art and include 
combining murine variable regions with human constant regions (Cabily, et al., 
Proc.Natl.Acad.Sci USA, 81:3273, 1984), or by grafting the murine-antibody 
complementary determining regions (CDRs) onto the human framework 

15 (Riechmann, et at., Nature, 322:323, 1988). 

Any of the above described antibodies or biologically active antibody fragments 
can be used to generate CDR grafted and chimeric antibodies. "CDR" or 
"complementarity determining region" or "hypervariable region" are each defined 
as the amino acid sequences on the light and heavy chains of an antibody which 
20 form the three-dimensional loop structure that contributes to the formation of the 
antigen binding site. 

As used herein, the term "CDR grafted" antibody refers to an antibody having an 
amino acid sequence in which at least parts of one or more CDR sequences in 
the light and/or variable domain have been replaced by analogous parts of CDR 
25 sequences from an antibody having a different binding specificity for a given 
antigen or receptor. 
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The terms "light chain variable region" and "heavy chain variable region" refer to 
the regions or domains at the N-terminal portion of the light and heavy chains 
respectively which have a varied primary amino acid sequence for each antibody. 
The variable region of the antibody consists of the amino terminal domain of the 
5 light and heavy chains as they fold together to form a three-dimensional binding 
site for an antibody. 

The analogous CDR sequences are said to be "grafted" onto the substrate or 
recipient antibody. The "donor" antibody is the antibody providing the CDR 
sequence, and the antibody receiving the substituted sequences is the 
10 "substrate" antibody. One of skill in the art can readily produce these CDR 
grafted antibodies using the teachings provided herein in combination with 
methods well known in the art (see . Borrebaeck, Antibody Engineering: A 
Practical Guide (W.H. Freeman and Company, New York, 1992)). 

Under certain circumstances, monoclonal antibodies of one isotype might be 
1 5 more preferable than those of another in terms of their diagnostic or therapeutic 
efficacy. For example, from studies on antibody-mediated cytolysis it is known 
that unmodified mouse monoclonal antibodies of isotype gamma-2a and gamma- 
3 are generally more effective in lysing target cells than are antibodies of the 
gamma-1 isotype. This differential efficacy is thought to be due to the ability of 
20 the gamma-2a and gamma-3 isotypes to more actively participate in the cytolytic 
destruction of the target cells. Particular isotypes of a monoclonal antibody of 
different isotype, by using the sib selection technique to isolate class-switch 
variants (Steplewski, et al., Proc. NaVIAcad, ScL USA, 82:8653, 1985; Spira, et 
at., J. Immunol. Methods, 74:307, 1984). 
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The invention also encompasses cell lines which produce monoclonal antibodies 1 
of the invention. The isolation of cell lines producing monoclonal antibodies of 
the invention can be accomplished using routine screening techniques which 
permit determination of the elementary reaction pattern of the monoclonal 
5 antibody of interest. Thus, if a monoclonal antibody being tested binds and 
neutralizes the activity associated with the specific peptide, for example binds 
CDK4I and blocks CDK4l-mediated biological activity, then the monoclonal 
antibody being tested and the monoclonal antibody produced by the cell lines of 
the invention are equivalent. 

10 By using the monoclonal antibodies of the invention, it is possible to produce 
anti-idiotypic antibodies which can be used to screen monoclonal antibodies to 
identify whether the antibody has the same binding specificity as a monoclonal 
antibody of the invention. These antibodies can also be used for immunization 
purposes (Herlyn, etaL, Science, 232:100, 1986). Such anti-idiotypic antibodies 

15 can be produced using well-known hybridoma techniques (Kohler and Milstein, 
Nature, 256:495, 1975). 

An anti-idiotypic antibody is an antibody which recognizes unique determinants 
present on the monoclonal antibody produced by the cell line of interest. These 
determinants are located in the hypervariable region of the antibody. It is this 

20 region (paratope) which binds to a given epitope and, thus, is responsible for the 
specificity of the antibody. An anti-idiotypic antibody can be prepared by 
immunizing an animal with the monoclonal antibody of interest. The immunized 
animal will recognize and respond to the idiotypic determinants of the immunizing 
antibody and produce an antibody to these idiotypic determinants. By using the 

25 anti-idiotypic antibodies of the immunized animal, which are specific for a mono- 
clonal antibody of the invention produced by a cell line which was used to 
immunize the second animal, it is now possible to identify other clones with the 
same idiotype as the antibody of the hybridoma used for immunization. Idiotypic 
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identity between monoclonal antibodies of two cell lines demonstrates that the 
two monoclonal antibodies are the same with respect to their recognition of the 
same epitopic determinant. Thus, by using anti-idiotypic antibodies, it is possible 
to identify other hybridomas expressing monoclonal antibodies having the same 
5 epitopic specificity. 

It is also possible to use the anti-idiotype technology to produce monoclonal 
antibodies which mimic an epitope. For example, an anti-idiotypic monoclonal 
antibody made to a first monoclonal antibody will have a binding domain in the 
hypervariable region which is the "image" of the epitope bound by the first 
10 monoclonal antibody. Thus, the anti-idiotypic monoclonal antibody can be used 
for immunization, since the anti-idiotype monoclonal antibody binding domain 
effectively acts as an antigen. 

VIII. IMMUNOLOGICAL USE OF Anti-CDK4I ANTIBODIES 

Once produced as described supra, anti-CDK4l antibodies may be used 
15 diagnostically (e.g., to detect CDK4I in a biological cell sample or monitor the 
level of its expression). Preferably, to detect the CDK4I protein in premalignant 
somatic cells, a suitable cell sample will be derived from skin biopsies, sputum 
specimens, or urinary specimens. Germline cells may be obtained from any 
convenient source, such as skin, blood, or hair follicles. 

20 CDK4I may be detected and/or bound using anti-CDK4l antibodies in either liquid 
or solid phase immunoassay formats (when bound to a carrier). Examples of 
well-known carriers for use in solid-phase assay formats include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, agaroses and magnetite. The nature of the 

25 carrier can be either soluble or insoluble for purposes of the invention. Those 
skilled in the art will know of other suitable carriers for binding antibodies, or will 
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be able to ascertain such, using routine experimentation. Examples of types of 
immunoassays which can utilize monoclonal antibodies of the invention are 
competitive and non-competitive immunoassays in either a direct or indirect 
format. 

5 Specific examples of such immunoassays are the radioimmunoassay (RIA) and 
the sandwich (immunometric) assay. Binding CDK4I using the anti-CDK4l 
antibodies of the invention can be done utilizing immunoassays which are run in 
either the forward, reverse, or simultaneous modes, including immuno- 
histochemical assays on physiological samples. Those of skill in the art will 
10 know, or can readily discern other immunoassay formats without undue 
experimentation. 

The anti-CDK4l antibodies of the invention may also be detectably labelled. 
There are many different labels and methods of labeling known to those of 
ordinary skill in the art. Examples of the types of labels which can be used in the 

15 present invention include enzymes, radioisotopes, fluorescent compounds, 
colloidal metals, chemiluminescent compounds, and bio-luminescent com- 
pounds. Those of ordinary skill in the art will know of other suitable labels for 
binding to the anti-CDK4l antibodies of the invention, or will be able to ascertain 
such, using routine experimentation. Furthermore, the binding of these labels to 

20 the anti-CDK4l antibodies of the invention can be done using standard 
techniques common to those of ordinary skill in the art. Another labeling 
technique which may result in greater sensitivity consists of coupling the 
antibodies to low molecular weight haptens. These haptens can then be 
specifically detected by means of a second reaction. For example, it is common 

25 to use haptens for this purpose such as biotin, which reacts with avidin. 

The anti-CDK4l antibodies of the invention can also be used for in vivo diagnosis, 
such as to identify a site of infection or inflammation or to monitor a particular 
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therapy. In using the anti-CDK4l antibodies of the invention for the in vivo 
detection of antigen having a peptide of the invention, the detectably labeled 
monoclonal antibody is given in a dose which is diagnostically effective. The 
term "diagnostically effective" means that the amount of detectably labeled anti- 
5 CDK4I antibody is administered in sufficient quantity to enable detection of the 
site having cells which express CDK4I. 

The concentration of detectably labeled anti-CDK4l antibody which is adminis- 
tered should be sufficient such that the binding to a peptide of the invention is 
detectable compared to the background. Further, it is desirable that the 
10 detectably labeled antibody be rapidly cleared from the circulatory system in 
order to give the best target-to-background signal ratio. 

As a rule, the dosage of detectably labeled anti-CDK4l antibody for in vivo 
diagnosis will vary depending on such factors as age, sex, and extent of disease 
of the individual. The dosage of antibody can vary from about 0.01 mg/m 2 , to 
15 about 500 mg/m 2 , preferably 0.1 mg/m 2 to about 200 mg/m 2 , most preferably 
about 0.1 mg/m 2 to about 10 mg/m 2 . Such dosages may vary, for example, 
depending on whether multiple injections are given, tissue, and other factors 
known to those of skill in the art. 

For in vivo diagnostic imaging, the type of detection instrument available is a 
20 major factor in selecting a given radioisotope. The radioisotope chosen must 
have a type of decay which is detectable for a give type of instrument. Still 
another important factor in selecting a radioisotope for in vivo diagnosis is that 
the half-life of the radioisotope be long enough so that it is still detectable at the 
time of maximum uptake by the target, but short enough so that deleterious 
25 radiation with respect to the host is minimized. Ideally, a radioisotope used for 
in vivo imaging will lack a particle emission, but produce a large number of 
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photons in the 140-250 keV range, which may be readily detected by 
conventional gamma cameras. 

The anti-CDK4l antibodies of the invention can be used in vitro and in vivo to 
monitor the course of disease therapy. For example, the CDK4I protein and 
5 peptide fragments of the invention may be used dtagnostically in biological fluids 
and tissues to monitor the fate of anti-CDK4l antibodies used therapeutically as 
described below. 

IX. THERAPEUTIC USES OF CDK4I 

A. Administration of Pharmaceutical Compositions 

10 Because cancers related to deletion of, or polymorphisms in, the gene for CDK4I 
are causatively related to the loss of, or reduction in, the inhibitory activity of 
CDK4I, administration of a therapeutically effective amount of CDK4I will delay, 
if not also prevent, the progression or onset of such cancers. Also, because 
many CDK4I gene deletions and polymorphisms are present in cells which are 

15 also genetically deficient in the ability to produce MTAse, then combined 
therapeutic regimes directed to providing the patient with therapeutically effective 
amounts of both CDK4I and MTAse will also be of benefit in delaying, if not also 
preventing, the progression or onset of such cancers. 

These ends may be achieved through the direct administration of purified, 
20 synthetic or recombinant CDK4I and, where appropriate, MTAse. Alternatively, 
these ends may be achieved by gene therapy, particularly gene replacement 
therapy. 

Means for the production of purified, synthetic or recombinant CDK4I and/or 
MTAse will be known to, or can be readily ascertained, by one of ordinary skill in 
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the art in combination with the information concerning CDK4I and MTAse 
provided in this disclosure (i.e., at SEQ.ID.Nos 1-5 and 14; see also . Figure 2 (a- 
b) (showing the genomic nucleotide sequence for the CDK4I gene, with exons 
underlined; and, Figure 10, showing the genomic nucleotide sequence for the 
5 MTAse gene, with the exons underlined). 

CDK4I compositions are prepared for administration by mixing CDK4I having the 
desired degree of purity with physiologically acceptable carriers. Such carriers 
will be nontoxic to recipients at the dosages and concentrations employed. 
Ordinarily, the preparation of such compositions entails combining the particular 

10 protein with buffers, antioxidants such as ascorbic acid, low molecular weight 
(less than about 10 residues) polypeptides, proteins, amino acids, carbohydrates 
including glucose or dextrins, chelating agents such as EDTA, glutathione and 
other stabilizers and excipients. Such compositions may also be lyophilized and 
will be pharmaceutically acceptable; i.e., suitably prepared and approved for use 

15 in the desired application. 

Given that CDK4I will be absent or of reduced efficacy in malignant or 
premalignant cells, cells having that condition will be the preferred targets for 
introduction of the CDK4I compositions of the invention. Where, however, the 
CDK4I abnormalities to be treated are in germline or somatic cells with no other 
20 detectable signs of malignancy, administration may be by any enteral or 
parenteral route in dosages which will be varied by the skilled clinician depending 
on the patient's presenting condition and the therapeutic ends to be achieved. 

In this regard, "biological activity" of CDK4 refers to the enzymatic reaction 
stemming from the binding of CDK4 to cyclin D and related molecules during the 
25 growth cycle of a human cell. Further, "biological activity" of CDK4I refers to the 
inhibition of the biological activity of CDK4 stemming from the binding of CDK4 
by CDK4I. 
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Generally, therefore, a "therapeutically effective dosage" of a CDK4I composition 
will be a dosage sufficient to inhibit the biological activity of CDK4 in human cells 
wherein CDK4I is absent or its biological activity is reduced (as a result, for 
example, of a polymorphism in the gene for CDK4I). To this end, the dosage of 
5 CDK4I can vary from about 0.1 mg/kg to about 300 mg/kg, preferably from about 
0.2 mg/kg to about 200 mg/kg, in one or more dose administrations daily, for one 
or several days. 

B. Gene Therapy 

The present invention identifies mutations in a target sequence of CDK4I that are 
1 0 unique to the primary tumor isolated from a subject and metastatic sites derived 
from the primary tumor. In the tumor cells, the mutated nucleotide sequence is 
expressed in an altered manner as compared to expression in a normal cell; 
therefore, it is possible to design appropriate therapeutic (as well as diagnostic) 
techniques directed to this specific sequence. Thus, where a cell-proliferative 
15 disorder is associated with the expression of a particular mutated tumor 
suppressor gene nucleic acid sequence, a nucleotide sequence that interferes 
with the specific expression of the mutated gene at the transcriptional or 
translations level can be used. This approach utilizes, for example, antisense 
oligonucleotides and/or ribozymes to block transcription or translation of a 
20 specific mutated mRNA, either by masking that mRNA with an antisense nucleic 
acid or by cleaving it with a ribozyme. 
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Antisense nucleic acids are DNA or RNA molecules that are complementary to 
at least a portion of a specific mRNA molecule (Weintraub, Scientific American, 
262:40, 1990). To date, several tumor suppressor genes and oncogenes have 
been targeted for suppression or down-regulation including, but not limited to, 
5 p53 (V. S. Prasolov et a/., Mol. Biol. (Moscow) 22:1105-1112, 1988); ras (S.K. 
Anderson et a/., Mol. /mmuno/. J26:985-991, 1989; D. Brown et al., Oncogene 
Res._4:243-249, 1989); fos (B. Levi et a/., Ceil. Differ. Dev. 25 (Suppl):95-102, 
1988; D. Mercola et al., Gene_Z2:253-265, 1988); and myc (S. O. Freytag, Mol. 
Cell. Biol. 8:1614-1624, 1988; E. V. Prochownik et a/., Mol. Cell. B/o/._8:3683- 
10 3695, 1988; S. L. Loke et al., Curr. Top. Microbiol. Immunol. 141:282-288, 1988). 

It is not sufficient in all cases to block production of the target mutant gene. As 
described in A. J. Levine, et al., (Biochimica et Biophisica Acta., 1032 :119-136. 
1990), there are at least five types of mutations that can contribute to the tumor 
phenotype. Briefly, Type I mutations are those mutations in genes that result in 

15 abnormal protein products, which act in a positive dominant fashion. Examples 
of such mutations are those in H-/as and K-ras genes that result in amino acid 
changes at positions 12 or 61 in the protein, leading to a protein that binds GTP 
and is constantly signaling for cell growth. Type II mutations are those that result 
in overproduction of an oncoprotein, such as the bcr-abl translocation that results 

20 in overproduction of a normal myc protein and an altered abl protein. Type III 
mutations are loss of function mutations wherein tumors arise as the result of 
loss of both alleles, such as with the retinoblastoma sensitivity gene (Rb) on 
human chromosome 13q14 and the WilnrTs tumor sensitivity gene localized at 
1 1q13. In 75% of colorectal carcinomas, one allele at the p12-p13.3 locus of 

25 chromosome 17 containing the p53 gene is commonly deleted, and in some 
cases the other p53 allele which remains in the colorectal cancer cells has been 
shown to produce a mutant p53 protein that presumably contributes to 
tumorigenesis. Type IV mutations are those that result in expression of a protein 
that does not directly contribute to the growth of cells, but enhances the ability 
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of cancer cells to survive. For instance, mutations to the v-ert>-A gene results in 
erythoblasts transformed with the altered gene being kept in the replication cycle. 
Type V mutations result from addition of new genetic information into tumor cells, 
commonly by way of a virus. In some cases the virus integrates its DNA into the 
5 cellular genome to produce proteins that bind to cellular negative regulators of 
growth, such as RB and p53, and thus, in effect, mimic the Type III loss of 
function mutation mechanism. 

Antisense therapy can be used to block production of mutant proteins that act 
directly to increase the probability of producing neoplastic cells, such as in 
mechanism Type III, Type IV and Type V mutations that mimic Type III. 
Antisense is also therapeutically effective when mutation is not dominant, for 
instance when a non-mutant allele remains that encodes the proper protein. 
However, when the mutation is dominant, as in Type I mutations, and in cases 
wherein either both alleles are deleted or one is deleted and the other is mutant, 
as in certain Type III mutations, antisense therapy is preferably accompanied by 
replacement therapy. In replacement therapy a wild type gene is introduced into 
the target cells identified as having a mutant tumor suppressor gene or 
protooncogene which results in production of the wild type protein necessary to 
forestall development of the neoplasia associated with the identified mutant 
gene(s). 

In the case of tumor suppressor genes, it is known that introducing a suppressor 
gene into cultured cells either causes cell death or causes no discernible 
changes, however, the cells may no longer be tumorigenic in animals. Thus, in 
cases where ribozyme and/or antisense therapy is accompanied by gene 
25 replacement therapy, the chances are increased that the cell population 
containing the mutant gene for which the ribozyme or antisense oligonucleotide 
is specific will no longer contribute to development of neoplasia in the subject 
being treated. 
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Synthetic antisense oligonucleotides are generally between 15 and 25 bases in 
length. Assuming random organization of the human genome, statistics suggest 
that a 17-mer defines a unique sequence in the cellular mRNA in human DNA; 
a 15-mer defines a unique sequence in the cellular mRNA component. Thus, 
5 substantial specificity for a selected genetic target is easily obtained using the 
synthetic oligomers of this invention. 

In the cell, the antisense nucleic acids hybridize to the corresponding mRNA, 
forming a double-stranded molecule. The antisense nucleic acids, interfere with 
the translation of the mRNA, since the cell will not translate a mRNA that is 

10 double-stranded. Antisense oligomers of about 15 nucleotides are preferred, 
since they are easily synthesized and are less likely to cause problems than 
larger molecules when introduced into the target nucleotide mutant producing 
cell. The use of antisense methods to inhibit the in vitro translation of genes is 
well known in the art (Marcus-Sakura, Anal.Biochem., 172:289, 1988). Less 

15 commonly, antisense molecules which bind directly to the DNA may be used. 

Ribozymes are RNA molecules possessing the ability to specifically cleave other 
single-stranded RNA in a manner analogous to DNA restriction endonucleases. 
Through the modification of nucleotide sequences that encode these RNAs, it is 
possible to engineer molecules that recognize specific nucleotide sequences 
20 associated with production of a mutated proto oncogene or tumor suppressor 
gene in an RNA molecule and cleave it (Cech, J.Amer.Med. Assn., 260:3030, 
1988). A major advantage of this approach is that, because they are sequence- 
specific, only target mRNAs with particular mutant sequences are inactivated. 

There are two basic types of ribozymes, namely, tetrahymena-type (Hasselhoff, 
25 Nature, 334:585, 1988) and M hammerhead"-type. Tetrahymena-type ribozymes 
recognize sequences which are four bases in length, while "hammerhead'-type 
ribozymes recognize base sequences 11-18 bases in length. The longer the 
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recognition sequence, the greater the likelihood that the sequence will occur 
exclusively in the target mRNA species. Consequently, hammerhead-type 
ribozymes are preferable to tetrahymena-type ribozymes for inactivating a 
specific mRNA species, and 18-based recognition sequences are preferable to 
5 shorter recognition sequences. 

Unmodified oligodeoxyribonucleotides are readily degraded by serum and 
cellular nucleases. Therefore, as is well known in the art, certain modifications 
of the phosphate backbone have conferred nuclease resistance to antisense 
DNA. For instance phosphorothioate, methylphosphonate, and a-anomeric 
sugar-phosphate, backbone-modified oligomers have increased resistance to 
serum and cellular nucleases. In addition, methylphosphonates are nonionic and 
offer increased lipophilicity to improve uptake through cellular membranes. The 
use of modified oligonucleotides as antisense agents may require slightly longer 
or shorter sequences because chemical changes in molecular structure can 
affect hybridization (L A. Chrisey et aL, BioPharm 4:36-42, 1991). These 
backbone-modified oligos bind to a target sequence and exert their inhibitory 
effects by blocking the binding of the cell's translational machinery to a specific 
RNA or by inducing ribonuclease H activity through the formation of RNA/DNA 
duplex structures. 

20 The present invention also provides gene therapy for the treatment of cancer 
conditions; i.e., cell proliferative disorders that are mediated by a deletion of, or 
polymorphism in, the CDK4I gene. Such therapy would achieve its effect by 
introduction of the specific antisense polynucleotide and/or replacement wild type 
gene into cells identified by the methods of this invention as having the 

25 proliferative disorder caused by mutated genes. Whether the cell will require 
replacement of the wild type gene encoding the CDK4I gene as well as antisense 
therapy to prevent replication of a CDK4I gene bearing a polymorphism must be 
determined on a case by case basis and will depend upon whether the mutation 
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has a dominant effect, ie., whether both alleles of the wild type gene have been 
destroyed so that total absence of the gene has a cell proliferative effect. 

Delivery of antisense tumor suppressor polynucleotides specific for mutated 
genes as well as of replacement wild type genes can be achieved using a 
5 recombinant expression vector such as a chimeric virus or a colloidal dispersion 
system. Preferred for therapeutic delivery of antisense sequences is the use of 
liposomes, especially targeted liposomes. 

Various viral vectors that can be utilized for gene therapy as taught herein 
include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as 

10 a retrovirus. Preferably, the retroviral vector is a derivative of a murine or avian 
retrovirus. Examples of retroviral vectors in which a single foreign gene can be 
inserted include, but are not limited to: Moloney murine leukemia virus 
(MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor 
virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of additional 

15 retroviral vectors can incorporate multiple genes. All of these vectors can 
transfer or incorporate a gene for a selectable marker so that transduced cells 
can be identified and generated. By inserting one or more sequences of interest 
into the viral vector, along with another gene which encodes the ligand for a 
receptor on a specific target cell, for example, the vector is now target specific. 

20 Retroviral vectors can be made target specific by inserting, for example, a 
polynucleotide encoding a sugar, a glycolipid, or a protein. Preferred targeting 
is accomplished by using an antibody to target the retroviral vector. Those of skill 
in the art will know of, or can readily ascertain without undue experimentation, 
specific polynucleotide sequences which can be inserted into the retroviral 

25 genome to allow target specific delivery of the retroviral vector containing the 
polynucleotides of interest. A separate vector can be utilized for targeted 
delivery of a replacement gene to the cell(s), if needed, or the antisense 
oligonucleotide and the replacement gene can optionally be delivered via the 
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same vector since the antisense oligonucleotide is specific only for target gene 
containing a polymorphism. 

Since recombinant retroviruses are defective, they require assistance in order to 
produce infectious vector particles. This assistance can be provided, for 
5 example, by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences 
within the LTR. These plasmids are missing a nucleotide sequence that enables 
the packaging mechanism to recognize an RNA transcript for encapsidation. 
Helper cell lines that have deletions of the packaging signal include, but are not 
10 limited to, UJ2, PA317 and PA12, for example. These cell lines produce empty 
virions, since no genome is packaged. If a retroviral vector is introduced into 
such helper cells in which the packaging signal is intact, but the structural genes 
are replaced by other genes of interest, the vector can be packaged and vector 
virion can be produced. 

15 Another targeted delivery system for antisense polynucleotides is a colloidal 
dispersion system. Colloidal dispersion systems include macromolecule 
complexes, nanocapsules, microspheres, beads, and lipid-based systems 
including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 
preferred colloidal system of this invention is a liposome. Liposomes are artificial 

20 membrane vesicles which are useful as delivery vehicles in vitro and in vivo. It 
has been shown that large unilamellar vesicles (LUV), which range in size from 
0.2-4.0 ptm can encapsulate a substantial percentage of an aqueous buffer 
containing large macromolecules. RNA, DNA and intact virions can be encapsu- 
lated within the aqueous interior and be delivered to cells in a biologically active 

25 form (Fraley, et a/., Trends Biochem. Sc/., 6:77, 1981). In addition to mammalian 
cells, liposomes have been used for delivery of polynucleotides in plant, yeast 
and bacterial cells. In order for a liposome to be an efficient gene transfer 
vehicle, the following characteristics should be present: (1) encapsulation of the 
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genes encoding the antisense polynucleotides at high efficiency while not 
compromising their biological activity; (2) preferential and substantial binding to 
a target cell in comparison to non-target cells; (3) delivery of the aqueous 
contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
5 accurate and effective expression of genetic information (Mannino, et ai t 
Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
particularly high-phase-transition-temperature phospholipids, usually in 
combination with steroids, especially cholesterol. Other phospholipids or other 
10 lipids may also be used. The physical characteristics of liposomes depend on 
pH, ionic strength, and the presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl 
compounds, such as phosphatidylglycerol, phosphatidylcholine, 
phosphatidylserine, phosphatidylethanolamine, sphingolipids, cerebrosides, and 
15 gangliosides. Particularly useful are diacylphosphatidylglycerols, where the lipid 
moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon atoms, 
and is saturated. Illustrative phospholipids include egg phosphatidylcholine, 
dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine. 
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The targeting of liposomes can be classified based on anatomical and 
mechanistic factors. Anatomical classification is based on the level of selectivity, 
for example, organ-specific, cell-specific, and organelle-specific. Mechanistic 
targeting can be distinguished based upon whether it is passive or active. 
5 Passive targeting utilizes the natural tendency of liposomes to distribute to cells 
of the reticulo-endothelial system (RES) in organs which contain sinusoidal 
capillaries. Active targeting, on the other hand, involves alteration of the 
liposome by coupling the liposome to a specific ligand such as a monoclonal 
antibody, sugar, giycolipid, or protein, or by changing the composition or size of 
10 the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 

The surface of the targeted delivery system may be modified in a variety of ways. 
In the case of a liposomal targeted delivery system, lipid groups can be 
incorporated into the lipid bilayer of the liposome in order to maintain the 
15 targeting ligand in stable association with the liposomal bilayer Various linking 
groups can be used for joining the lipid chains to the targeting ligand. 

Other means for performing gene therapy are known in the art; to wit, Feigner, 
et a/., Science, 247:1465, 1990; Stibling, et a/., Proc.NatLScLAcad. USA, 
89:11277-11281, 1992; and, Tang, et a/., Nature, 356:152-154, 1992, (the 
20 disclosures of which are incorporated herein by this reference to illustrate 
knowledge in the art concerning methods for performing gene therapy). 
However, the preferred means for performing gene therapy of the invention is the 
administration of such genes in "naked", non-replicating form (i.e., without 
association with a viral vector, liposome, host cell or equivalent means for 
* 25 expression of nucleic acids). Further, the preferred routes for administration of 
such naked nucelotides is via injection into skeletal muscle or, most preferably, 
via introduction into tissue which contains a relatively high concentration of 
antigen presenting cells. 
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X. CDK4I KITS AND PRODUCTS 

For use in the diagnostic research and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research 
applications such kits may include any or all of the following: assay reagents, 
5 buffers, CDK4I protein and/or fragments, CDK4I recombinant expression vectors, 
CDK4I oligonucleotides and other hybridization probes and/or primers, and/or a 
suitable assay device. A therapeutic product may include sterile saline or 
another pharmaceutical^ acceptable emulsion and suspension base for use in 
reconstituting lyophilized CDK4I oranti-CDK4l suspensions, suitably labeled and 
10 approved containers of CDK4I or anti-CDK4l compositions, and kits containing 
these products for use in connection with the diagnostic kit components as 
described above. 

Such a kit may also comprise a carrier means being compartmentalized to 
receive in close confinement one or more container means such as vials, tubes, 
15 and the like, each of the container means comprising one of the separate 
elements to be used in the method. 

For example, one of the container means may comprise a hybridization probe 
that is or can be detectably labelled. A second container may comprise a cell 
lysis buffer. The kit may also have containers holding nucleotide(s) for 
20 amplification of the target nucleic acid sequence and/or a container comprising 
a reporter-means, such as a biotin-binding protein, such as avidin or streptavidin, 
bound to a reporter molecule, such as an enzymatic, fluorescent, or radionuclide 
label. 

The invention having been fully described, it is further illustrated by the example 
25 below. It will be understood, however, that the invention is not limited by the 
examples but is defined by the appended claims. 
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EX AMPLE 1 

IDENTIFICATION AND CHA RACTERIZATION OF THE CDK4I GENE 

MTAse cDNA (SEQ ID NO: 14) was isolated and used to probe a human 
placenta lambda phage library. A 2 kilobase Hind III fragment contained the 3- 
5 end of the MTAse gene by sequence analysis. Chromosome walking was 
performed, starting with the 3-end of MTAse. Several screening cycles of the 
known P1 phage (see, e.g., Pierce, etai, Meth. EnzymoL, 216:549-574, 1992) 
and subsequent lambda phage libraries led to the isolation of clones that 
encompassed the deleted region in T98G. Restriction fragments of these phage 
10 were subcloned, partially sequenced, and mapped by Southern blotting and 
poulsed field gel electrophoresis. Figure 4 shows the map of human 
chromosome 9p21 between the MTAP and interferon-3 (IFNB) gene loci, 
focusing on the deleted segment in the T98G glioma cell line. 

The polymerase chain reaction (PCR) was used to determine the frequency of 
15 deletion of several sequence tagged sites (STS) from chromosome 9p in 46 
different human malignant cell lines (Table 1). Depending on the cell type, either 
STS 54F, or STS 5Bs was deleted most frequently. These results focused 
attention on the 50 kilobase region between STS 54F and STS 5BS. 

Eight malignant cell lines with breakpoints between 54F and 5BS were then 
20 analyzed by STS-PCR, with new probes from the intervening region. The 
deletion maps are shown in Figure 5. A 19 kilobase lambda phage clone (10B1) 
identified the most frequently deleted site ( see . Figure 4 (a)). Phage DNA of 
clone 10B1 was digested with ECORI and subcloned into ECO-RI-cut pBLUE- 
SCRIPT II SK+ (Stratagene, La Jolla, CA). DNAs from human placenta and 
25 melanoma cell lines were digested with EcoRI, resolved on a 0.8% agarose gel, 
and transferred to nylon membranes. Subclones were subjected to automated 
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DNA sequencing. The 4.2Kb subclone 10B1-10 contained both the CDK4I and 
the CDK413' nucleotide sequences (SEQ ID NO's 1-2 and 4-5) while the CDK4I5' 
nucleotide sequence is contained in a 10A1 subclone. 

The sequence of the 10B1-10 subclone from clone 10B1 (Figure 4 (a)) contains 
5 a 306 base pair open reading frame. The 3-end of the coding region, and the 
3-noncoding region, are located 2.6 kilobases toward the MTAse gene while the 
5-end of the gene is telomeric to the deleted region in T98G. 
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The PCR amplification reactions were carried out in a total volume of 20 (A, 
containing 0.1 1*9 of DNA, 1x PCR buffer (10 mM) Tris-HC1, pH 8.3, 50 mM KC1, 
1.5 mM MgC1 2 , 0.01% gelatin), 200 of each dNTP, 20 ng each of sense and 
anti-sense primers, and 0.5 units of Taq DNA polymerase. Thirty-five cycles 
5 were performed (64 °C annealing and 72 °C extension) followed by gel 
electrophoresis. 

CDK415 (SEQ ID NO. 3) is a 139 bp product generated by reverse transcriptase 

- PCR in cell line H661 (ATCC Accession No. ) using a sense primer (5- 

AATTCGGCACGAGGCAGCAT-3') and an anti-sense primer (5- 

10 TTATTTGAGCTTTGGTTCTG-3'). PCR products were subcloned and 
sequenced. Clone p7-4 (ATCC Accession No. 55540) contained the 5' sequence 
of the CDK4 inhibitor cDNA. A 139 bp product was amplified from clone p7-4 
with a sense primer and a new anti-sense primer (5-TCGGCCTCC- 
GACCGTAACTA-3') and used for Southern blotting. Blots were hybridized at 

15 65°C overnight, washed at 65°C in 0.1 x SSC containing 0.1% SDS, and 
exposed to X-ray film. 

EXAMPLE II 

DELETION OR POLYMORPHISMS IN THE CDK4I GENE 

IN CANCER CELL LINES 

20 As shown in Figure 9, the 46 originally screened malignant cell lines (Table 1) 
were rescreened with STS-PCR primers, corresponding to the CDK41' and 
CDK413' exons (SEQ ID NO.'s 8-11). Sixty-one percent of melanomas, 87% of 
gliomas, 45% of non-small cell lung cancers, and 64% of leukemias have 
homozygous deletions of the CDK41 gene fragment (Table 1). 

25 Melanoma cell line WM266-4 has deleted only the 5'-end of the CDK4 inhibitor 
gene (SEQ ID NO. 3). It was positive for CDK4I', negative for STS 5BS, and 
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produced an abnormal 7.0 kiiobase band after EcoRI digestion, electrophoresis 
and hybridization to a probe from the 5-region of the CDK4 inhibitor gene. On 
the other hand, melanoma cell line SK-MEL-31 has deleted only the 3-end of the 
CDK4I gene (SEQ ID NO. 5). The Detroit 462 cell line (a pharyngeal carcinoma) 
5 has a 29 kiiobase deletion within the CDK4I gene. It was positive for CDK4I3', 
negative for CDK4I', but positive for STS-5BS and STS-71 F. The latter two STSs 
are located centromeric to the 5-end of the CDK4 inhibitor gene. 

Reverse transcriptase-polymerase chain reaction (RT-PCR) assays in human 
cells revealed CDK4 inhibitor gene transcript in normal cells, but not in cancers 
10 with established deletions of the CDK4 inhibitor gene (Figure 9). 

To perform the assays, mRNA was purified with a "FASTTRACK" Kit (Invitrogen, 
San Diego, CA) and was treated with RNase-free DNase I (Pharmacia) using 
human placenta DNA as a control to ensure complete DNase I digestion. After 
first-strand cDNA synthesis with a Stratascript RT-PCR Kit (Stratagene La Jolla, 
15 CA), cDNA was amplified with CDK4I3' primers (58°C annealing and 70°C 
extension). 

Primers for the control G3PDH gene (5-TGGTATGGTGGAAGGACTCATGAC-3* 
and S-ATGCCAGTGAGCTTCCCGTTCAGC-S') amplified a 190 bp product 
(55°C annealing and 72°C extension). RT-PCR's for the CDK4I3 1 exon and 
20 G3PDH were performed separately and resolved on a 2% agarose gel. The 355 
bp RT-PCR product seen in lanes 1, 2 and 4 of Figure 9 derived from cDNA. 
These results indicate that human cells contain a single CDK4 inhibitor gene, that 
is homozygously deleted or rearranged in the majority of melanomas, gliomas, 
and leukemias, and in many non-smail cell lung cancers. 

25 EXAMPLE ill 

DETECTION OF A DELETION OF THE CDK4I GENE 
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A. Preparation of Solid Support Materials for PCR-ELISA. 

Twenty §A of 2.5 pmol//J an aminated oligonucleotides specific for the CDK4I 
gene in 50mM 2-[N-morpholino] ehthanesulfonic acid and 1mM EDTA, pH 5.5 
5 were placed in each well of a 96 well microtiter plate made of polycarbonate 
(Costar. Cambridge, MA.). Then 20/J of 4 mg/ml 1-ethyl-3-(3-dimethylamino- 
propyl) carbodimide hydrochloride (EDC, Pierce Chemical) were added and the 
plate was incubated at 37°C for 2 hours. Wells were then washed once with 
phosphate buffered saline (PBS) and blocked with 1% bovine serum albumin 
10 (BSA)forone hour. 

B. Triple Primer PCR Amplification. 

Using the primers described in SEQ ID NO's: 8-13, genomic DNA obtained from 
the cell lines identified in Table 1, supra, was amplified as follows. O.Vg of 
genomic DNA was added to an amplification mixture consisting of 10mM Tris- 
15 HCI, pH 8.3, 50mM KCI, 1.5mM MgCI 2) and 0.01% gelatin (PCR buffer), as well 
as 200/iM of each dNTP, 20ng each of the primers, and 0.5 units of Taq DNA 
polymerase. Thirty cycles were performed in a Perkin-Elmer Cetus DNA thermal 
cycler, each cycle consisting of denaturation (94°C, 1 minute), annealing (50- 
55°C, 1 minute) and extension (72°C, 1 minute). 

20 C. Detection of Hybridization and Extension of Immobilized Primers 

The wells were washed three times with HW buffer (3xSSC, 0.1 %N- 
lauroylsarcosine) and once with blocking buffer (0.5% GENIUS blocking reagant 
(a trademarked product of Boehringer Mannheim), in 100mM Tris-HCI, pH 7.5, 
and 800mM NaCI), and incubated with 80^1 of tetramethylbenzidine and 
25 horseradish peroxidase (kikegaard & Perry Laboratories). The reaction was 
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stopped with 80/J of 1 M O-phosphate at the appropriate time point. 1 50 M I each 
was transferred to another microtiter plate and OD was measured at 450 nm with 
a microtiter reader from Molecular Devices, Menlo Park, CA. 
The results of this assay are summarized in Table 1 , supra. 

5 EXAMPLE V 

DETECTION OF A GERM LINE NONSENSE MUTATION 
IN DYSPLASTIC NEVUS SYNDROME CELLS 

Primers for CDK4I' (SEQ ID NO's: 8-9) were constructed and the reverse 
transcriptase polymerase chain reaction (RT-PCR) used to amplify a CDK4I gene 

10 transcript in a human lymphoblastoid cell line (GM06921; ATCC Acession 

No. ) derived from a human patient with dysplastic nevus syndrome (familial 

melanoma). Using the technique described by Orita, et a/., supra, and/or the 
technique described by Wu, et a/., supra, a mutated form of the CDK4I gene 
transcript was identified in the GM06921 cell line). Sequence analysis of the 

15 transcript revealed a C to T transition at position 166 of the mRNA, which results 
in a nonsense mutation ( see. Figure 6). 

EXAMPLE VI 

DETECTION OF A CDK415' GENE MICRODELETION 
IN A LEUKEMIA CELL LINE 

20 Primers for CDK4I5' (SEQ ID NO f s: 12-13) were constructed and the reverse 
transcriptase polymerase chain reaction (RT-PCR) used to amplify a CDK4I gene 
transcript in a human leukemia cell line, U937 (ATCC Accession No. CRL 1593). 
Using the technique described by Orita, et a/., supra, and/or the technique 
described by Wu, et ai, supra, a mutated form of the CDK4I5' gene transcript 

25 was identified in the U937 cell line and sequenced, showing a microdeletion of 
18 base pairs ( see . Figure 7). 
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SUMMARY OF SEQUENCES 

SEQ ID NO: 1 is the nucleotide sequence for the 5' region of human genomic 
CDK4I and the corresponding, predicted amino acid sequence for the 5' region 
of CDK4I. 

5 SEQ ID NO. 2 is the nucleotide sequence for the internal and 3' regions of 
human, genomic CDK4I. 

SEQ ID NO's: 3 through 5 are, respectively, the CDKI5', CDK4P, and CDK4I3' 
exons. 

SEQ ID NO's: 6 and 7 are sequences for oligonucleotide primers for the region 
10 between 54F and 5BS of the 9p21 chromosome (i.e., corresponding to clone 
10B1). 

SEQ ID NO's: 8 through 13 are sequences for oligonucleotide primers for the 
CDK4P, CDK4I3' and CDK4I5' exons, respectively. 

SEQ ID NO: 14 is the full-length genomic nucleotide sequence for MTAse. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA 



(ii) TITLE OF INVENTION : METHODS FOR TREATMENT AND DETECTION OF 
SUSCEPTIBILITY TO, AND PROGRESSION OF, CANCER 

(iii) NUMBER OF SEQUENCES: 14 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: S PENS LEY HORN JUBAS & LUBITZ 

10 (B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 92037 



15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



20 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 4/11/95 

(C) CLASSIFICATION: 
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( vi i i ) ATTORNEY /AGENT INFORMATION : 

(A) NAME: HOWELLS, STACY L . 

(B) REGISTRATION NUMBER: 34,842 

(C) REFERENCE /DOCKET NUMBER: FD-3459 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE ; 619/455-5100 

(B) TELEFAX: 619/455-5110 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1146 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 (vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 5 1 region of CDK4I (genomic) 



(ix) 



FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1146 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTTGGGGNNA AGTTTGGGAA AANCCAATCC TCCTTCCTTT CCAACNNTGC TTCTGGCGAG 
60 

GCTCCTTCCC GGCTTGTTCC CCCNGGGGGA AGACCCAACC TGGGCCGACC TTCAGGGTTC 
5 120 

CCACATTCCC TAANTGCTCG GAGTTAATAN CACCTCCTCC GAGNACTCGC TCACGNCGTC 
180 

CCCTTNCCTG GAAAGATACC GCGNTCCCTC NAGAGGATTT GAGGGACAGG GTCGGAGGGG 
240 

10 NCTCTTCCGC CAGCACCGGA GGAAGAAAGA GGAGGGGCTG GCTGGTCACC AGAGGGTGGG 

300 

GCGGACCGCG TGCGCTCGGC GTCTGCGGAG AGGGGGAGAG CAGGCAGCGG GCGG CGGGGA 
360 

GCAGCATGGA GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGC CACGG 
15 420 

CCGCGGCCCG GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA 
480 

ACGCACCGAA TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAGGGTCT GCAGCGGGAG 



540 
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CAGGGGATGG CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG 
600 

CTTCGATTCT CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC 
660 

5 ACAGACCTCC TCCTGGCGAC GCCCTGGGGG CTTGGGAAGC CAAGGAAGAG GAATNAGGAG 

720 

CCACGCGCGT ACAGATCTCT CGAATGCTGA SAMGATYTTR AGGGS SGRAM ATATTTGTAT 
780 

TCAGATGGAA GTATKCTCTT TATCAGATAC AAAATTTACG AACGTTTGGG ATAAAAAGGG 
10 840 

AGTCTTAAAG AAATKTAAGA TGTKCTGGGA CTACTTAGCC TCCAATTCAC AGATACCTGG 
900 

ATGGAGCTTA TCTTTC TTAC TAGGAGGGAT TATCAGTGGA AATCTGTGGN GTATGTTGGA 
960 

1 5 ATAAATATCG AATATAAATT TTGATCGAAA TTATTCAGAA GCGGCCGGGC GCGGTGCCTC 

1020 

ACGCCTTGTA ATCCCTTCAC TTTGGGAGAT CAAGGCGGGG GGGAATCANC TGAGGTCGGG 
1080 

AGTTCGAGAA CAGCCTGGGC AACAGGTGAA AACCTCGCCT CCTACTAAAA AATACAAAAA 



20 



1140 
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GTAGNC 
1146 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) IiENGTH: 4286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



10 (vii) IMMEDIATE SOURCE: 

(B) CLONE: internal and 3' region of CDK4I (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..4286 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GAATTCATTG TGTACTGAAG AATGGATAGA GAACTCAAGA AGGAAATTGG AAACTGGAAG 
60 

CAAATGTAGG GGTAATTAGA CACCTGGGGC TTGTGTGGGG GTCTGCTTGG CGGTGAGGGG 
120 
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GCTCTACACA AGCTTCCTTT CCGTCATGCC GGCCCCGACC CTGGCTCTGA CCATTCTGTT 
180 

CTCTCTGGCA GGTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG 
240 

5 GCGCGGAGCC CAACTGTGCC GACCCCGNCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC 

300 

GGGAGGGCTT CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC 
360 

GCGATGCCTG GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGNGATGTCG 
10 420 

CACGGTACCT GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATNCC CGNATAGATG 
480 

CCGCGGAAGG TCCCTCAGGT GAGGACTGAT GATCTNAGAA TTTGNCCCCT GAGAGCTTCC 
540 

15 AAAGCTCAGA GNATTCATTT TCCAGCACAG AAAGTNCAGC CCGGGAGANC AGTCTCCGGT 

600 

CTTGTCTCAG CTCACGCGCC AATCGGTGGG ACGGCCTGAG TCTCCCTATC GCCCTGCCCC 
660 



20 



GCCAGGGCGG CAAATGGGAA ATAATCCCGA AATGGACTTG CGCACGTGAA AGCCCATTTT 



720 
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GTACATTATA CTTCCCAAAG CATACCACCA CCCAAACACC TACCCTCTGC TAGTTCAAGG 
780 

CCTAGACTGC GGAGCAATGA AGACTCAAGA GGCTAGAGGT CTAGTGCCCC CTCTTCCTCC 
840 

5 AAACTAGGGC CAGTTGCATC CACTTACCAG GTCTGTTTCC TCATTTGCAT ACCAAGCTGG 

900 

CTGGACCAAC CTCAGGATTT CCAAACCCAA TTGTGCGTGG CATCATCTGG AGATCTCTCG 
960 

ATCTCGGCTC TTCTGCACAA CTCAACTAAT CTGAACCTCC TCAGCTAATC TGACCCTCCG 
10 1020 

CTTNATGCGG TAGAGTTTAC CAGAGCTGCC CCAGGGGGTT CTGGGGACAT CAGGACCAAG 
1080 

ACTTCGCTGA CCCTGGCAGT CTGTGCACCG GAGTTGGCTC CTTTCCCTCT TAAACTTGTG 
1140 

15 CAAGAGATCG CTGAGAGATG AAGGTAGAAT TATGGTCCTC CTTGCCCTNG CCTTTCCTTT 

1200 

TAGTGATCTC AAAGCATCCT CCCTCCGTCC CCATTCCATG GCCCCAGTTC ACTACTCCCA 
1260 

CAGCTGTCTG GTGAAACTGA CAACATTACT CAATTGTTTC TGGGGGGAGG AACATTTTTT 
20 1320 
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TTTGAAACAA AATAGATATA TGAAACAGTA CACGGGAATT AACACGATTA TTTAAGGTAA 
1380 

AACATGACCT TGAAGATTAT GAAATCCATC TTATTTTGGC CCAGAACGGG GGCATTGGKC 
1440 

5 TCCTTGGCCC ATAGGGGAGC TGGGGAGGAC AGGGTGAAGA GTTAGCTCTA AGCCCTCTNN 

1500 

TTGGAGATGC TGTAAATACA GAACGCAAAA TCACCTTCGA AGTTAAAGAC GCGAAGTTCT 
1560 

TCTTTACTCG GCCCCTCCTC CCCTCCCCCC CGACAATTCC CTCCAGTTAC AGCTAGCATC 
10 1620 

CAGGTCCCGG GAGGTGAAGA AGGAGACTTC GGCTCCAGTT ACAGCTAGCA TCCGGGTCCC 
1680 

GATTTAGAAG GAGCTGCCAA TTACAGCGCG GTTCCAGGGC TGAGCAAAAA GCCTGAGGAG 
1740 

15 CCAAGTGGGA GAGGGAGTAA AACTACTGAA TTGGGCCACA AGCAAATGAA TAAACTGAAC 

1800 

GACTCTTAAC CAAAC CTAAT ATATTTAATC CAAACACACA AGTCTTTCAT TTCTTCCCTC 
1860 



20 



CTCCCTTCCT TCTCTTACTC CCCAACACCC CCTCTTCAAG CACAATTAAT TATATGG TTA 
1920 
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GATTCTACTG CGTGATCAGC CCTGTTCTAG GTGGTGGGCA CGCCAAGGTG AATGAGACCA 
1980 

AACAAGAGTC TTGCCCTCAT GGGGTTTACA TTTGGAGACA GAGTCGATCT GTTGCCCAAC 
2040 

5 CTGGAGTGCA GTGGCGCGAT CACAGCTCAC TGCAGCCTCA AACTCCCTGG CTCAAGGGGT 

2100 

TCTCCCACCT GAGCCTCCCG ACTAGCTGGG ACCACAGGTG CACGCCACGA CGCCTGGGTT 
2160 

TGTTTGTTTG TTTAATAGAG ACGAAGGTCT CACCATGTTA TCTGGGCTCA AGCGATCATC 
10 2220 

CCCCCTCCTC CTCCTAAAGT ACTGGGATTA CAGTCCCAAG CTATCTTGCC CGACCTGGGA 
2280 

AACAGACGTT AAGGAAGATA ACAATCTATT TTCAGAGAGC GAGTTTATAA AACCAATGCA 
2340 

15 ATGGGTAAAT ATGAAGTGTG AATAGGAGGA GAAGCTAAAG AGTGGTCGGA GAATCTAATG 

2400 

CAAGCTACGG GAGAAAGAAA CTCAAGTGCA AATGCTGCCT CAGGAATAAA CGTAAAAAGA 
2460 

GACTTTCAAG TGCAAATGCT CCCTCAGGAA TAAAATAATC TTGAGACTCT CAAGTGTAAA 
20 2520 
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TGCTGCCTCG GGAGAACCGA ACGGCGAGCT GGAGCCCATA CGCAACGAGA TTAGAGAGGA 
2580 

AGGCAGAAGC CAGAGCACAT GAATAAATGA GCATCCATTT TGTTTCAGAA ATGATCGGAA 
2640 

5 AC C ATTTGTG GGTTTGTAGA AGCAGGCATG CGTAGGGAAG CTACGGGATT CCGCCGAGGA 

2700 

GCGCCAGAGC CTGAGGCGCC CTTTGGTTAT CGCAAGCTGG CTGGCTCACT CCGCACCAGG 
2760 

TGCAAAAGAT GCCTGGGGAT GCGGGAAGGG AAAGGCCACA TCTTCACGCC TTCGCGCCTG 
10 2820 

GCATTGTGAG CAACCACTGA GACTCATTAT ATAACACTCG TTTTCTTCTT GCAACCCTGC 
2880 

GGGCCGCGCG GTCGCGCTTT CTCTGCCCTC CGCCGGGTGG ACCTGGAGCG CTTGAGCGGT 
2940 

15 CGGCGCGCCT GGAGCAGCCA GGCGGGCAGT GGACTAGCTG CTGGACCAGG GAGGTGTGGG 

3000 

AGAGCGGTGG CGGCGGGTAC ATGCACGTGA AGCCATTGCG AGAACTTTAT CCATAAGTAT 
3060 



TTCAATGCCG GTAGGGACGG CAAGAGAGGA GGGCGGGATG TTCCACACAT CTTTGACCTC 
20 3120 
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AGGTTTCTAA CGCCTGTTTT CTTTCTGCCC TCTGCAGACA TCCCCGATTG AAAGAACCAG 
3180 

AGAGGCTCTG AGAAACCTCC GGAAACTTAG ATCATCAGTC ACCGAAGGTC CTACAGGGCC 
3240 

5 ACAACTGCCC CCGCCACAAC CCACCCCGCT TTCGTAGTTT TCATTTAGAA AATAGAGCTT 

3300 

TTAAAAATGT CCTGCCTTTT AACGTAGATA TATGCCTTCC CCCACTACCG TAAATGTCCA 
3360 

TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC CGCTTCTGCC 
10 3420 

TTTTCACTGT GTTGGAGTTT TCTGGAGTGA GCACTCACGC CCTAAGCGCA CATTCATGTG 
3480 

GGCATTTCTT GCGAGCCTCG CAGCCTCCGG AAGCTGTCGA CTTCATGACA AGCATTTTGT 
3540 

15 GAACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT AGCAAATGGC 

3600 

AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTCATT TATTGTCAAC 
3660 

ATTTATTGAG CACCTATTAC AACAATTTCA TCGCATGGAA GACAGCATCG TTTCTGACAC 
20 3720 
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TGTTGTTTCA TGTATCTCTT AGAAAAACGC TGCTATTAGA CATCTAACAC TATTTATCTT 
3780 

GAGGTGATAA AATATCAAAA GCCGTGTCTC AAGATCGATG AAATGCGGTT. AAAATGATGA 
3840 

5 ATAGAAACTC TAGGGGGACC TCATATCGAT AGACTCGAGA CTGGCACATC TGGAGATCCG 

3900 

TATTTATCCG GCTTCCGCTT CCAGATCACG CGAGGTTTGG GATATTTTGC TCACCAGGCC 
3960 

TCAGCCAGGT AACTGAATCC AGCCAACCCT GGCCCATAGT CTCGGAATCC GACTCGGCTC 
10 4020 

CCAGTCCCCG CCTCGGCGTT CTGAGACCCC CAGGCTGGGT TCCAAGAGGG CTGTGAGGTT 
4080 

GCGAATGACT GCTGCCAAAC CGGAAGGAAC TCTGCGGTTC TCTGCCACAG TGGGATTGTT 
4140 

15 GCAGGCACGC GGCTCAGACT TCACTGAGGT TGGGAGATGC TCCTGTCCAC GCTGCCTCAT 

4200 

CCCGTGCTGG AGCACTGCAC CTCTATTTTT TTTTTTAGGG TACACGCCAC ATAACATAAA 
4260 

ACTAAAAATT TTAAAGAGTA GAATTC 
20 4286 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK4I5* (genomic exon) 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..126 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGGAGCCTT CGGCTGACTG GCTGGCCACG GCCGCGGCCC GGGGTCGGGT AGAGGAGGTG 
15 60 

CGGGCGCTGC TGGAGGCGGG GGCGCTGCCC AACGCACCGA ATAGTTACGG TCGGAGGCCG 
120 

ATCCAG 
126 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK41 1 (genomic exon) 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. ;306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GTCATGATGA TGGGCAGCGC CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC 
15 60 

AACTGTGCCG ACCCCGNCAC TCTCACCCGA CCCGTGCACG ACGCTGCCCG GGAGGGCTTC 
120 

CTGGACACGC TGGTGGTGCT GCACCGGGCC GGGGCGCGGC TGGACGTGCG CGATGCCTGG 
180 
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GGCCGTCTGC CCGTGGACCT GGCTGAGGAG CTGGGCCATC GNGATGTCGC ACGGTACCTG 
240 



CGCGCGGCTG CGGGGGGCAC CAGAGGCAGT AACCATNCCC GNATAGATGC CGCGGAAGGT 
300 

5 CCCTCA 
306 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
15 (B) CLONE: CDK4I3 1 (genomic exon) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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GACATCCCCG ATTGA 
15 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



10 (vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK4I 1 primer 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .20 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GGAAATTGGA AACTGGAAGC 
20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK4 1 1 primer 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 1..20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CAGGTCATGA TGATGGGCAG 
20 



(2) INFORMATION FOR SEQ ID NO: 8: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



20 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK43 * primer 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
5 (B) LOCATION: 1..20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCCGCTTTCG TAGTTTTCAT 
20 

(2) INFORMATION FOR SEQ ID NO: 9: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK43 • primer 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

CAGAACCAAA GCTCAAATAA 
20 

5 (2) INFORMATION FOR SEQ ID NO: 10; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: 5BS primer 

(ix) FEATURE: 
15 (A) NAME /KEY : CDS 

(B) LOCATION: 1-.20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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GCTTAGTTTT AGAGGGTGAT 
20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



10 (vii) IMMEDIATE SOURCE: 

(B) CLONE: BBS primer 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
<B) LOCATION: 1..20 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CATCACTCAT AAGAACTGCT 
20 



WO 95/28169 



PCT/US95/0465S 



-94- 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK4I5 1 primer (sense) 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ACCATGGAGC CTTGGCTGA 
15 19 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CDK4I5 ' primer ( ant i sense) 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CAATAGTTAC GGTCGGAGG 
15 19 

(2) INFORMATION FOR SEQ ID NO : 14 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2763 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Methyl thioadenosine Phosphorylase (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..2763 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

10 TTTATACAGA GCATGACA6T GGGGTCCTCA CTAGGGTCTG TCTGCCACTC TACATATTTG 

60 

AAACAGGAGT GGCTTCTCAG AATCCAGTGA ACCTAAATTT TAGTTTTAGT TGCTCACTGG 
120 

ACTGGGTTCT AGGAGACCCC CTGTGTTAGT CTGTGGTCAT TGCTAGSAGA ATCACTTAAT 
15 180 

TTTTTCTAGA CTCTAGGAGA AAACAGTTGG TGGTGTACTC ATCACGGGTT AACAATTTCT 
240 

TCTCTCCTTC CATAGGCATG GAAGGCAGCA CACCATCATG CCTTCAAAGG TCAACTACCA 
300 
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GGCGAACATC TGGGCTTTGA AGGAAGAGGG CTGTACACAT GTCATAGTGA CCACAGCTTG 
360 

TGGCTCCTTG AGGGAGGAGA TTCAGCCCGG CGATATTGTC ATTATTGATC AGTTCATTGA 
420 

5 CANNNNNNNN NNNNNNNNNN GAGGTCGACG GTATCGATAA GCTTTGTAAA CAATTGTCTT 

480 

TAGCTTATCC AGAGGAATTG AGTCTGGAGT AAAGACCCAA ATATTGACCT AGATAAAGTT 
540 

GACTCACCAG CCCTCGGAGG ATGGAAAGAT GGCCTTAAAA TAAAACAAAC AAAAACCTTT 
10 600 

TTTGCTTTAT TTTGTAGGAC CACTATGAGA CCTCAGTCCT TCTATGATGG AAGTCATTCT 
660 

TGTGCCAGAG GAGTGTGCCA TATTCCAATG GCTGAGCCGT TTTGCCCCAA AACGAGAGAG 
72 0 

15 GTGTGTAGTC TTTCTGGAAG GTGTACCAGA ATAAATCATG TGGGCTTGGG GTGGCATCTG 

780 

GCATTTGGTT AATTGGCAGA CGGAGTGGCC CCATACCCTC ACTCAAGTTT GCTTTGTATT 
840 



20 



ATGCAAGTTT ATGGAGAGTT ATTTCCTGTT GCTAATAATT TNNNNNNNNN NNNNNNNNNN 



900 
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AAGTGCAGCC TTAAGTTGTG CATGTGCTAG TATGTTTTGA AGTTTCTGGT TTTTCTTTTC 
960 

TAGGTTCTTA TAGAGACTGC TAAGAAGCTA GGACTCCGGT GCCACTCAAA GGGGACAATG 
1020 

5 GTCACAATCG AGGGACCTCG TTTTAGCTCC CGGGCAGAAA GCTTCATGTT CCGCACCTGG 

1080 

GGGGCGGATG TTATCAACAT GACCACAGTT CCAGAGGTGG TTCTTGCTAA GGAGGCTGGA 
1140 

ATTTGTTACG CAAGTATCGC CATGGGCACA GATTATGACT GCTGGAAGGA GCACGAGGAA 
10 1200 

GCAGTAGGTG GAATTCTTTT CTAAGCACAT ATAGCATGGG TTTCTGGGTG CCAATAGGGT 
1260 

GTCTTAACTG TTTGTTTCTA TTACGTTAGT TTCAGAAAGT GCCTTTCTAC AAGGTTTTGA 
1320 . 

1 5 AGTTGTTAAT ATTTTCTGTA GTTCCATTGG AAGGTAAGAA CAAAGATCAA AAGAAAGAAA 

1380 

GAGACACTTT TACCCAAGGA TCAGTAGTGA AAATAGTACA TTGTAGGCAT GTAGATGTGT 
1440 

TGAGAATCAT ACTAAGACTT GGGCCTTANN NNNNNNNNNN NNNNNNNNNN NNTACCCTAC 

20 1500 
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ATTGAGGATT CGGTTTCAGC AGATAAATTT 
1560 

GCTGGAGCTC AGAAAAATGT TTTATGACAA 
1620 

5 AGTGCTATTG TTTCTCTAGG TTTCGGTGGA 

1680 

TAATAAAGCC AAAAGCTTAC TGCTCACTAC 
1740 

AGAAACCCTC CATAACCTGA AGGTAAGTGC 
10 1800 

TCTCTATTGT CTTCTTTTCT TACTTGCATT 
1860 

GCCTAGATGT TTTCAACAAG TTTTTGTGAC 
1920 

15 AACTGAGTAG TCTTATTTTC TTGGCTGGTA 

1980 

ATAATCCAGG CTGGGCTGGT ATGGCAATAA 
2040 

CATTATTAAC CTCACTTTAC AGGAAAGGGA 



GAGGGACACA AACATTTAGG CTGTAGCAAG 



GCAGTGGAAT TTTAAGTTCT AGTAACCTCC 



CCGGGTCTTA AAGACCCTGA AAGAAAACGC 



CATACCTCAG ATAGGGTCCA CAGAATGGTC 



AGCCATGGAC AATCAGGCAT GTCTGTAGAC 



TCACCTTTGG TCCTCATGTA TTTTTTGCCA 



ATCTACTACT ACCATACCAA CCACTTGTGA 



GTGCAGANNN NNNNNNNNNN NNAATAAACA 



GTGATTATCA GAACAATGCT CTGAGATAAG 



GGTGAGGAAC CAAGAGTTTA GAGTACCCGA 



20 



2100 
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AGTTCCACAT CTGGTTAGTG AACTTGAAAA TTTTCTGTAG AATTTATTTA AAGTGTATGT 
2160 

TTCCTGCGTC CTCACTTTGA TCTAGAAAAT CAAAATCTGT TTTTTTTTTT AACAAACATC 
2220 

5 TCAGTAATTA CGCCAACATG TGAATATCAC TGCCTCCTTT CTTCCTTTCA GAATATGGCC 

2280 

CAGTTTTCTG TTTTATTACC AAGACATTAA AGTAGCATGG CTGCCCAGGA GAAAAGAAGA 
2340 

CATTCTAATT CCAGTCATTT TGGGAATTCC TGCTTAACTT GAAAAAAATA TGGGAAAGAC 
10 2400 

ATGCAGCTTT CATGCCCTTG CCTATCAAAG AGTATGTTGT AAGAAAGACA AGACATTGTG 
2460 

TGTATAGAGA CTCCTCAATG ATTTAGACAA CTTCAAAATA CAGAAGAAAA GCAAATGACT 
2520 

15 AGTAACATGT GGGAAAAAAT ATTACATTTT AAGGGGGAAA AAAAACCCCA CCATTCTCTT 

2580 

CTCCCCCTAT TAAATTTGCA ACAATAAAGG GTGGAGGGTA ATCTCTACTT TCCTATACTG 
2640 

CCAAAGAATG TGAGGAAGAA ATGGGACTCT TTGGTTATTT ATTGATGCGA CTGTAAATTG 
20 2700 



WO 95/28169 



PCT/US95/04655 



-101- 

GTACAGTATT TCTGGAGGGC AATTTGGTAA AATGCATCAA AAGACTTAAA AATACGGACG 
2760 

TAC 



2763 
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CLAIMS 



1. An isolated polynucleotide which will encode CDK4I or a biologically 
active fragment thereof. 

2. A polynucleotide according to Claim 1 having the nucleotide sequence 
5 contained in SEQUENCE ID Nos. 1-2. 

3. A recombinant expression vector containing at least one of the 
polynucleotides of Claim 1. 

4. CDK4I or biologically active fragments thereof expressed by the 
recombinant expression vector of Claim 3. 

10 5. Isolated, substantially pure CDK4I or biologically active fragments thereof. 

6. An isolated polypeptide having an amino acid sequence expressed by at 
least one of the polynucleotide sequences contained in SEQUENCE ID 
Nos. 3-5. 

7. Isolated, functional variants of the polypeptide of Claim 6. 

15 8. The polynucleotide according to Claim 2 wherein the polynucleotide 

contains a polymorphism. 



9. 



The polynucleotide according to Claim 8 wherein the polymorphism 
consists of the deletion or substitution of at least one base pair. 
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10. 
11. 

5 

12. 

13. 

10 

14. 

15. 
16. 

15 

17. 
18. 

20 
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The polynucleotide according to Claim 9 wherein the cytosine at position 
166 of the mRNA corresponding to SEQUENCE ID.Nos.: 1-2 is 
substituted with thymine. 

The polynucleotide according to Claim 9 wherein the deletion includes a 
deletion of the base pairs from about position 1 80 to position 1 98 of 
SEQUENCE ID.No.: 1. 

Oligonucleotides which will specifically hybridize to the polynucleotides of 
Claim 1 . 

Oligonucleotides which will specifically hybridize to the polynucleotides of 
Claim 2. 

Antibodies which will specifically bind CDK4I or biologically active 
fragments thereof. 

Antibodies which will specifically bind the polypeptide of Claim 6. 

Antibodies which will specifically bind at least one of the polypeptides of 
Claim 7. 

A pharmaceutical composition containing substantially pure CDK4I or 
biologically active fragments thereof and a pharmaceutically acceptable 
carrier. 

A pharmaceutical composition comprising the polypeptide of Claim 4 in 
substantially pure form and a pharmaceutically acceptable carrier. 
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A pharmaceutical composition comprising the polypeptide of Claim 6 in 
substantially pure form and a pharmaceutical^ acceptable carrier. 

A pharmaceutical composition comprising at least one of the polypeptides 
of Claim 7 in a substantially pure form and a pharmaceutically acceptable 
carrier. 

A method for diagnosing a cancer condition in a human comprising 
detecting all or a polynucleotide encoding all or part of CDK4I in a 
biological cell sample from the human. 

The method according to Claim 21 wherein the biological cell sample 
comprises one or more somatic cells. 

The method according to Claim 21 wherein the biological cell sample 
comprises one or more germline cells. 

The method according to Claim 21 wherein the cancer condition 
comprises melanomas, gliomas, non-small cell lung cancers and 
leukemias. 

The method according to Claim 21 wherein the method further comprises 
detecting a polynucleotide which encodes all or a part of MTAse. 

The method according to Claim 21 wherein the PGR is used to amplify all 
or a part of the gene for CDK4I, if present, in the biological cell sample. 
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27. The method according to Claim 26 wherein the PCR is competitive PCR. 

28. The method according to Claim 27 wherein any CDK4I polynucleotide is 
detected by ELISA. 

29. A method for diagnosing a cancer condition in a human comprising 
5 detecting CDK4I in a biological cell sample from the human which sample 

is suspected of containing premalignant or malignant cells. 

30. The method according to Claim 29 wherein the CDK4I is detected by 
immunoassay. 

31 . A method for determining susceptibility to a cancer condition comprising 
10 detecting polymorphisms in the gene for CDK4I which polymorphisms 

cause a reduction in the biological activity of CDK4I. 

32. The method according to Claim 31 wherein the biological cell sample 
comprises one or more somatic cells. 

33. The method according to Claim 31 wherein the biological cell sample 
1 5 comprises one or more germline cells. 

34. The method according to Claim 31 wherein the cancer condition 
comprises melanomas, gliomas, non-small cell lung cancers and 
leukernias. 
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The method according to Claim 31 wherein the method further comprises 
detecting a polynucleotide which encodes all or a part of MTAse. 

The method according to Claim 31 wherein the PCR is used to amplify all 
or a part of the gene for CDK4I, if present, in the biological cell sample. 

The method according to Claim 36 wherein the PCR is competitive PCR. 

The method according to Claim 37 wherein any CDK4I polynucleotide is 
detected by ELISA. 

The method according to Claim 31 wherein the polymorphism consists of 
a nonsense substitution of at least one base pair in a CDK4I gene exon. 

The method according to Claim 39 wherein the cancer condition is 
dysplastic nevus syndrome. 

The method according to Claim 31 wherein the polymorphism consists of 
a deletion of at least one base pair in a CDK4I gene exon. 

The method according Claim 41 wherein the cancer condition is leukemia. 
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43. A method for inhibiting CDK4 activity in human cells in need of such 
inhibition comprising administering a therapeutically effective amount of 
CDK4I to a human. 

44. A method for inhibiting CDK4 activity in human cells in need of such 
5 inhibition comprising administering a polynucleotide which encodes CDK4I 

to a human in a form wherein the gene will express CDK4I in vivo in the 
human cells. 

45. A method according to Claim 43 wherein the polynucleotide will replace 
any wild-type CDK4I in the human cells. 

10 46. A method according to Claim 43 wherein the human cells contain a wild- 
type polynucleotide of SEQUENCE ID Nos. 1-2 which polynucleotide 
contains a polymorphism, wherein the method further comprises 
administering at least one antisense polynucleotide to the human, which 
antisense polynucleotide will inhibit the expression of an mRNA 

1 5 transcribed from the wild-type polynucleotide. 

47. A kit for use in performing the method according to Claim 21 comprising 
reagants and reactants useful in the method. 

48. A kit for use in performing the method according to Claim 31 comprising 
reagants and reactants useful in the method. 

20 49. A kit for use in performing the method according to Claim 43 comprising 

reagants and reactants useful in the method. 
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[received by the International Bureau on 21 September 1995 (21.09.95); 
original claims 6, 7, 20 and 41 cancelled; original claims 1-5 amended; 
claims 9-13, 15, 16, 19, 21-28, 30-40 and 42 amended and renumbered as claims 

7-11. 13, 14, 17, 18-25, 27-37 and 38; 
remaining claims unchanged but renumbered accordingly (5 pages)] 

1 . An isolated CDK4I polynucleotide. 

2. A polynucleotide according to Claim 1 consisting essentially of at least one of the 
CD4KI exons whose nucleotide sequences are contained in SEQUENCE ID Nos. 
3-5. 

3. A recombinant expression vector which operatively encodes at least one of the 
polynucleotides of Claim 2. 

4. CDK4I or fragments thereof containing a specific binding site for CDK4, 
expressed by the recombinant expression vector of Claim 3. 

5. Isolated, substantially pure CDK4I or peptide fragments thereof which contain a 
specific binding site for CDK4. 

6. The polynucleotide according to Claim 2 wherein the polynucleotide contains a 
polymorphism. 

7. The polynucleotide according to Claim 6 wherein the polymorphism consists of the 
deletion or substitution of at least one nucleotide. 
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8. A CDK4I polynucleotide wherein the nucleotide sequence of the polynucleotide 
consists of SEQUENCE ED. No. 4 and the cytosine at position 166 is substituted 
with thymine. 

9. A CDK4I polynucleotide wherein the nucleotide sequence of the polynucleotide 
consists of SEQUENCE ID. No. 4 and the nucleotides from about position 180 to 
position 198 are deleted. 

10. Oligonucleotides which specifically hybridize to the polynucleotides of Claim 1. 

1 1 . Oligonucleotides which specifically hybridize to the polynucleotides of Claim 2. 

12. Antibodies which will specifically bind CDK4I or biologically active fragments 
thereof. 

13 . Antibodies which will specifically bind the polypeptide of Claim 4. 

14. Antibodies which will specifically bind at least one of the polypeptides of Claim 
5. 

15. A pharmaceutical composition containing substantially pure CDK41 or biologi- 
cally active fragments thereof and a pharmaceutically acceptable carrier. 
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16. A pharmaceutical composition comprising the polypeptide of Claim 4 in 
substantially pure form and a pharmaceutically acceptable carrier. 



17. A pharmaceutical composition comprising a polypeptide of Claim 5 in 
substantially pure form and a pharmaceutically acceptable carrier. 

18. A method for diagnosing a cancer condition in a human comprising detecting a 
CDK4I polynucleotide in a biological cell sample from the human. 

19. The method according to Claim 18 wherein the biological cell sample comprises 
somatic cells. 



20. The method according to Claim 18 wherein the biological cell sample comprises 
germline cells. 



21. The method according to Claim 18 wherein the cancer condition comprises 
melanomas, gliomas, non-small cell lung cancers and leukemias. 

22. The method according to Claim 18 wherein the method further comprises detecting 
a polynucleotide which encodes MTAse. 



23. The method according to Claim 18 wherein the PCR is used to amplify CDK4I 
polynucleotide, if present, in the biological cell sample. 
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24. The method according to Claim 23 wherein the PCR is competitive PCR. 

25. The method according to Claim 24 wherein any CDK4I polynucleotide present in 
the biological cell sample is detected by ELIS A. 

26. A method for diagnosing a cancer condition in a human comprising detecting 
CDK4I in a biological cell sample from the human which sample is suspected of 
containing premalignant or malignant cells. 



27. The method according to Claim 26 wherein the CDK4I is detected by immunoas- 
say. 



28. A method for detennming susceptibility to a cancer condition comprising detecting 
CDK4I polynucleotide polymorphisms, which polymorphisms disable coding of 
CDK4I protein with binding affinity for CDK. 



29. The method according to Claim 28 wherein the biological cell sample comprises 
somatic cells. 



30. The method according to Claim 28 wherein the biological cell sample comprises 
germline cells. 



31. The method according to Claim 28 wherein the cancer condition comprises 
melanomas, glioms, non-small cell lung cancers and leukemias. 
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The method according to Claim 28 wherein the method further comprises detecting 
a polynucleotide which encodes MTAse. 

The method according to Claim 28 wherein the PCR is used to amplify CDK4I 
polynucleotide, if present, in the biological cell sample. 

The method according to Claim 33 wherein the PCR is competitive PCR. 

The method according to Claim 34 wherein any CDK4I polynucleotide present in 
the biological sample is detected by ELISA. 

The method according to Claim 28 wherein the polymorphism consists of a 
nucleotide substitution or deletion in a CDK4I gene exon. 

The method acording to Claim 36 wherein the cancer condition is dysplastic nevus 
syndrome. 

The method according to Claim 36 wherein the cancer condition is leukemia. 
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FIG. 1 
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FIG. 2a 



1 TTTGGGGNNA AGTmSGGAA AANCCAATCC TCCTTCCTTT CCAACIWTGC 
51 TTCTGGCGAG QCTCCTTCCC GGCTTGTTCC CCCNGGGGGA AGACCCAACC 
101 TGGGCCGACC TTCAGGGTTC CCACATTCCC TAANTGCTCG GAGTTAATAN 
151 CACCTCCTCC GAGNACTCGC TCACGNCGTC CCCTTNCCTC GAAAGATACC 
201 GCGNTCCCTC NAGAGGAHT GAGGGACAGG GTCGGAGGGG IOCTTCCGC 
251 CAGCACCGGA GGAAGAAAGA GGAGGGGCTG GCTGGTCACC AGAGGGTGGG 
301 GCGGACCGCG TGCGCTCGGC GTCTGCGGAG AGGGGGAGAG CAGGCAGCGG 
351 GCGGCGGGGA GCAGCATGGA GCCGGCGGCG GGGAGCAG CA TGGAGCCTTC 
401 QGCTGACTSG C TCGCCACGG CC6CGQCCCQ GGGTCGGGTA GAGGAGGTCC 
451 QGGCGCTGCT GGAG GCGGGG GCGCTGCCCA ACGCACCGAA TACTTAfrar 
501 CGGAGGCCGA TCCAGGTGGG TAGAGGGTCT GCAGCGGGAG CAGGGGATCG 
551 CGGGCGACTC TSGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG 
601 CTTCGATTUT CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG 
651 GTTTGTAATC ACAGACCTCC TCCTGGCGAC GCCCTOGGGG CTTGGGAAGC 
701 CAAGGAAGAG GAATNAGGAG CCACGCGCGT ACAGATCTCT CGAATGCTGA 
751 SAMGA7YTTR AGGGSSGRAN ATATTTGTAT TCAGATGGAA GTATKCTCTT 
801 TATCAGATAC AAAATTTACG AACGTTTGGG ATAAAAAGGG AGTCTTAAAG 
851 AAATKTAAGA TCTKCTOGGA CTACTTAGCC TCCAATTCAC AGATACCTGG 
901 ATGGAGCTTA TCT7TCTTAC TAGGAGGGAT TATCAGTGGA AATCTGTGGN 
951 GTATGT7GGA ATAAATATCG MTATAAATT TTGATCGAAA TTATTCAGAA 
1001 GCGGCCGGGC GCGGTGCCTC ACGCCT7GTA ATCCCTTCAC TTTGGGAGAT 
1051 CAAGGCGGGG GGGAATCANC TGAGGTCGGG AGTTCGAGAA CAGCCTGGGC 
1101 AACAGG7GAA AACCTCGCCT CCTACTAAAA AATACAAAAA GTAGNC 
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FIG. 2b(l) 



1 GAATTCATTG TGTACTGAAG AATGGATA6A GAACTCAAGA AGGAAATTGG 
51 AAACTGGAAG CAAATGTAGG GGTAATTAGA CACCTGGGGC TTGTCTGGGG 
101 GTCTGCTTGG CGGTCAGGGG GCTCTACACA AGCTTCCTTT CCGTCATCCC 
151 GGCCCCCACC CTGGCTCTGA CCATTCTGTT CTCTCTGGCA G GTCATGATG 
201 ATGGGCAGCG CC C6ASTSSC GGAGCTGCTS CTCCTCCACG GCGCGSAGCC 
251 CAACTCTBCC GACCCC6NCA CTCTC ACCCG ACCCCTBCAC GACGCTGCCC 
301 GSGASGGCTT C CTBGACAC6 CTBGTGGTCC TCCA CCGBGC rCRfifiPflrcc 
351 CTGGACGTGC G CGATCCCTG GGGCCGTCTG CCCGTGGACC TGGCTSAGfiA 
401 GCTSGGCCAT C6NGATGTCG CACGR TACCT GCGCGCGGTT GCGBGGGSCA 
451 CCA6A6GCAG T AACCATWCC CfiNATAGATC CCGCGGAAGG TCCCTCAGS T 
501 GAGGACTGAT GATCTNAGAA TTTCNCCCCT GAGAGCTTCC AAAGCTCAGA 
551 GMATTCATTT TCCAGCACAG AAAG7NCAGC CCGGGAGANC AGTCTCCGGT 
601 CTTGTCTCAG CTCACGCGCC AATCGGTGGG ACGGCCTGAG TCTCCCTATC 
651 GCCCTGCCCC GCCAGGGCGG CAAATGGGAA ATAATCCCGA AATGGACTTG 
701 CGCACGTGAA AGCCCATTTT GTACATTATA CTTCCCAAAG CATACCACCA 
751 CCCAAACACC TACCCTCTGC TAGTTCAAGG CCTAGACTGC GGAGCAATCA 
801 AGACTCAAGA GGCTAGAGGT CTAGTGCCCC CTCTTCCTCC AAACTAGGGC 
851 CAGTTGCATC CACTTACCAG GTCTGTTTCC TCATTTGCAT ACCAAGCTGG 
901 CTCGACCAAC CTCAGGATTT CCAAACCCAA TTGTGCGTGG CATCATCTBG 
951 AGATCTCTCG ATCTCGGCTC TTCTGCACAA CTCAACTAAT CTGAACCTCC 
1001 TCAGCTAATC TGACCCTCCG CTTNATGCGG TAGAGT7TAC CAGAGCTGCC 
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FIG. 2b(2) 

1051 CCAGGGGGTT C1T3GGGACAT CAGGACCMQ ACTTCGCT5A CCCTGGCAGT 

1101 c™™™ QMrrrescrc ctttccctct taaacttctg caagagatcg 

1151 "S^TG AAGGTAGAAT TATCGTCCTC CTTGCCC7NG CCTTTCCTTT 
1201 TAGTSATCTC AAAGCATCCT CCCTCCGTCC CCATTCCATG SCCCCAGTTC 
1251 ACTACTCCCA CAGCTGTCTC STGAAACTSA CAACATTACT CMTTCTTTC 
1301 TGGGGGGAGG AACATTTTTT TTTGAAACAA AATAGATATA TCAAACAGTA 
1351 CACGGGAATT AACAC6ATTA TTTAAGGTAA AACATQACCT T6AA6A7TAT 
1401 GAAATCCATC TTATTTTSSC CCAGAACG88 GGCATTGGKC TCCTTBSCCC 
1451 ATAGGGGAGC TO6GGA0GAC AQGGTGAAGA GTTAGCTCTA AGCCCItrnW 
1501 TTGGAfiATGC TGTAAATACA GAACGCAAAA TCACCTTCGA AGTTAAAGAC 
1551 GCGAASTTCT TCTJTACTCG GCCCCTCCTC CCCTCCCCCC CGACMTTCC 
1601 CTCCAGTTAC AGCTAGCATC CAGGTCCCGG GAGGTGAAGA AG6A6ACTTC 
1651 GGCTCCAS7T ACAGCTAGCA TCCGGGTCCC 6ATTTASAA6 GAGCTGCCAA 
1701 TTACAGCGCG GTTCCAGGGC TSAQCAAAAA GOTGAGGAG CCAAGTCGGA 
1751 GAGGGASTAA AACTACTSAA TTCGGCCACA ASCAAATUAA TAAACTCAAC 
1801 GACTCTTAAC CAAACCTAAT ATATTTAATC CAAACACACA AGTCTTTCAT 
1851 TTCTTCCCTC CTCCCTTCCT TCTCTTACTC CCCAACACCC CCTCTTCAAG 
1901 CACAATTAAT TATATSGTTA GATTCTACTC CGTGATCAGC CCTSTTCTAG 
1951 GT6GTCGGCA CQCCAAGGTG AATGAGACCA AACAAGAGTC TTGCCCTCAT 
2001 GGGGTTTACA TTTGGAGACA GAGTCGATCT GTTGCCCAAC CTGGA6TBCA 
2051 GTGGCGCGAT CACAGCTCAC TGCAGCCTCA AACTCCCTCG CTtAAGGaGT 
2101 TCTCOCACCT GAGCCTCCCG ACTAGCTGGG ACCACAGGTG CACGCCACGA 

2i5i cgcctcgutt TGnrcrrre tttaatagag acgaaggtct caccatctta 
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2201 


TCTQGGCTCA AGCGATCATC 


CCCCCTCCTC 


CTCCTAAAGT 


ACTGGGATTA 


2251 


CAGTCCCAAG CTAICI ItiCC 


CGACCTGGGA 


AACAGACGTT 


AAGGAAGATA 


2301 


ACAATCTATT TTCAGAGAGC 


GAGTTTATAA 


AACCAATGCA 


ATGGGTAAAT 


2351 


ATGAAGTGTG AATAGGAGGA 


GAAGCTAAAG 


AGTGGTCGGA 


GAATCTAATG 


2401 


CAAGCTACGG GAGAAAGAAA 


CTCAAGTGCA 


AATGCTGCCT 


CAGGAATAAA 


2451 


CGTAAAAAGA GACN ICAAG 


TGCAAATGCT 


CCCTCAGGAA 


TAAAATAATC 


2501 


TTGAGACTCT CAAGTGTAAA 


TGCTGCCTCG 


GGAGAACCGA 


ACGGCGAGCT 


2551 


GGAGCCCATA CGCAACGAGA 


TTAGAGAGGA 


AGGCAGAAGC 


CAGAGCACAT 


2601 


GAATAAATGA GCATCCATTT 


IGIIICAGAA 


ATGATCGGAA 


ACCAI 1 IGIG 


2651 


GGTTTGTAGA AGCAGGCATG 


CGTAGGGAAG 


CTACGGGATT 


CCGCCGAGGA 


2701 


GCGCCAGAGC CT6AGGCGCC 


CTTTGGTTAT 


CGCAAGCTGG 


CTGGCTCACT 


2751 


CCGCACCAGG TGCAAAAGAT 


GCCTGGGGAT 


GCGGGAAGGG 


AAAGGCCACA 


2801 


TCTTCACGCC TTCGCGCCTG 


GCATTGIGAG 


CAACCACTGA 


GACTCATTAT 


2851 


ATAACACTCG IIIIC1ICII 


GCAACCCTGC 


GGGCCGCGCG 


GTCGCGCTTT 




CTCTGCCCTC CGCCGGGTGG 


ACCTGGAGCG 


CTTGAGCGGT 


CGGCGCGCCT 


2951 


GGAGCAGCCA GGCGGGCAGT 


GGACTAGCTG 


CTGGACCAGG 


GAGGTGTGGG 


3001 


AGAGCGGTGG CGGCGGGTAC 


ATGCACGTGA 


AGCCATTGCG 


AGAACTTTAT 


3051 


CCATAAGTAT TTCAATGCCG 


GTAGGGACGG 


CAAGAGAGGA 


GGGCGGGATG 


3101 


TTCCACACAT CrilGACCTC 


AGGTTTCTAA 


CGCCIGIIII 


CTTTCTGCCC 


3151 


TCTGCAGACA TCCCCGATTG 


AAAGAACCAG 


AGAGGCICIti 


AGAAACCTCC 


3201 


GGAAACTTAG ATCATCAGTC ACCGAAGGTC 


CTACAGGGCC 


ACAACTGCCC 


3251 


CCGCCACAAC CCACCCCGCT TTCGTAGTTT 


TCATTTAGAA 


AATAGAGCTT 


3301 


TTAAAAATGT CCTSCCTTTT AACGTAGATA 


TATGCCTTCC 


CCCACTACCG 
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FIG. 



3351 TAAATGTCCA TTTATATCAT 

3401 AGAAAAACAC CGCTTCTGCC 

3451 GCACTCACGC CCTAAGCGCA 

3501 CAGCCTCCGG AAGCTGTCGA 

3551 AGCTCAGGGG GGTTACTGGC 

3601 AGAACCAAA6 CTCAAATAAA 

3651 TATTGTCAAC ATTTATTGAG 

3701 GACAGCATCG TTTCTGACAC 

3751 TGCTATTAGA CATCTAACAC 

3801 GCCGTGTCTC AAGATCGATG 

3851 TAGGGGGACC TCATATCGAT 

3901 TATTTATCCG GCTTCCCCTT 

3951 TCACCAGGCC TCAGCCAGGT 

4001 CTCGGAATCC GACTCGGCTC 

4051 CAGGCTGGGT TCCAAGAGGG 

4101 CGGAAGGAAC TCTGCGGTTC 

4151 GGCTCAGACT TCACTGAGGT 

4201 CCCGTGCTGG AGCACTGCAC 

4251 ATAACATAAA ACTAAAAATT 



2b(4) 



TTTTTATATA 


TTCTTATAAA 


AATGTAAAAA 


TTTTCACTGT 


GTTGGAGTTT 


TCTGGAGTGA 


CATTCATGTG 


GGCATTTCTT 


GCGAGCCTCG 


CTTCATGACA 


AGCAIIIIGI 


GAACTAGGGA 


TTCTCTTGAG 


TCACACTGCT 


AGCAAATGGC 


AATAAAATAA 


TTTTCATTCA 


TTCACTCATT 


CACCTATTAC 


AACAATTTCA 


TCGCATGGAA 


TGTTGTTTCA 


TGTATCTCTT 


AGAAAAACGC 


TATTTATCTT 


GAGGTGATAA 


AATATCAAAA 


AAATGCGGTT 


AAAATGATGA 


ATAGAAACTC 


AGACTCGAGA 


CTGGCACATC 


TGGAGATCCG 


CCAGATCACG 


CGAGGTTTGG 


GATATTTTGC 


AACTGAATCC 


AGCCAACCCT 


GGCCCATAGT 


CCAGTCCCCG 


CCTCGGCGTT 


CTGAGACCCC 


CTGTGAGGTT 


GCGAATGACT 


GCTGCCAAAC 


TCTGCCACAG 


TGGGAI Itil 1 


GCAGGCACGC 


TGGGAGATGC 


TCCTGTCCAC 


GCTGCCTCAT 


CTCTATTn 1 


IIIIIIAGGG 


TACACGCCAC 


TTAAAGAGTA 


GAATTC 
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FIG. 3 



5 CGGCACGAGGCAGCATGGAQCCTTCGGCTSACTGGCrrGGCCACGGCC^G 54 
lilt I 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 1 1 
376 CGGC6B6GAfi(^aCATBGAGCCTTCGGCTCAC^ 425 

55 GCCCGGGGTTCGGOTAGAQGAG(^GGGCkTGCTGGAGKGGTOGC^ 104 
lllllllllllllllll llllllllllllllllllllllllll 1 1 1 1 1 1 
426 GCCCGGGCTCGGOTAGAGGAGGTTGC^GGCGCTGCTGGAGGCGGGGGCKT 475 

105 GCCt^GCACCGAATAGTTACGGTCGGAGGCCGATCCAGGT 146 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
476 GCCCAACGCACCGAATAGTTACGGnrCGGAGGCCGATCCAGGT 517 



FIG. 4a 



189 CAGG^CATGAl^TGGGCAfiCGCCCGASTGGCGGAGCTBCT 238 

1 1 1 1 1 1 1 1 1 ■ t ■ 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 
142 CAl^CATGATG&TGGGK^G^ 191 

239 C(^GCGGAG(XCAACTCT^ 288 

iitiiiiiiiiiiiiiii 1 : 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 ■ i ■ 1 1 1 1 1 ■ 1 1 1 
192 CGGCGCGGAGCCCMCTGCGCCGACCCCGCCACTCTCACCCGACCCGnPGC 241 

289 ACGACQCTQCCCGQGAGGGCTTCCTGGACACGCTQGTO^ 338 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : i : ■ 1 1 

242 ACGACGCTGCCCGGGAGGGCmCTGGACACGCTGGTOGnH^ 291 

339 GCCGGGGCGCGGGrFGGACGTNCGCGAATNC^^ 388 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 1 1 1 1 1 1 : i ■ ■ 1 1 mi:i 
292 GCCGGGGCGCGGCrGGACGrTGCGCG . ATGCCTGGGGCCGTCTGCCCGTGG 340 

389 ACCTGGOTT^GftGCT^ 438 

Mini: i : i : i i 1 1 1 11:11111:1 1 1 : 1 1 1 : 1 1 1 1 1 n n : i n 
341 ACCTGGC . TGAG6AGCTGGGCCATCGCG . ATGTCGCACGGTACCTGCGCG 388 

439 CNGTTNGGGGGGG. .ACANAGGNAGGAACNATNCCC 472 

i:i i: nun ii:ni:n im:ii:iii 
389 CGGCTGCGGGGGGCACCAGAGGCAGnTAACCATGCCC 424 
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FIG. 6 



18 GATCATGGGCAGCGCCTGAGTGGCGGAGCreCTO^^ 67 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ll 1 1 1 1 1 1 1 1 1 
150 GATGATG^AGCGCCCGAGnTGGCGGAGCTGCTGCTGCTCCACGGCK 199 

68 AGCCCAAC 75 

1 1 1 1 1 1 1 1 
200 AGCCCAAC 207 



FIG. 7 



404 MTTCGGCACGAGGCAGCATGGAGCCTTCGGCTGACTGGCTGGCCACGGC 453 
I I I I I I I I 1 1 1 1 1 1 1 I I I I M I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 MTTCGGCACGAGGCAGCATGGAGCCTTCGGCTGACTGGCTGGCCACGGC 50 

454 CGCGGCCCGGGGTCGGGTAGAGGAQGTGCG! '. GG 485 

ill ll ilinilllllll lllliilll I M II 
51 CGCGGCCCGGGGTCGGGTAGAGGAGGTGCGGGCGCTGCTGGAGGCGCTGG 100 

486 CGCTGCCCAACGCACCGAATAGnTACGGTCGGAGGC 521 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 (l 1 1 1 1 1 1 1 1 
101 CGCTGCCCAACGCACC6AATAGTTACGGTCGGAGGC 136 
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FIG. 70 

1 TTTATACAGA GCATGACAGT GGGGTCCTGA CTAGGGTCTG TCTGCCACTC 

51 TACATATTTS AAACAQGAGT GGCTTCTCAG AATCCAGTGA ACCTAAATTT 

101 TAGTTTTAGT TGCTCACTGG ACTGGGTTCT AG6AGACCCC CTGTSTTAGT 

151 CTGTGGTCAT TGCTAGSAGA ATCACTTAAT TTTTTCTAGA CTCTAGGAGA 

201 AAACAGTTGG TGGTGTACTC ATCACGGGTT AACAATTTCT TCTCTCCTTC 

251 CATAGGCATG 6AAG6CASCA CACCATCATG CCTTCAAAGG TCAACTACCA 

301 GGCGAACATC TGGGCTTTSA AGGAASAGGS CTGTACACAT GTCATAGT6A 

351 CCACAGCTTG TGGCTCCTre AGGGAGGAGA TTCAGCCCGG CGATATTGTC 

401 ATTATTGATC AGTTCATTGA CA NNNNNNNN NNNNNNNNNN GAGGTCGACG 

451 GTATCGATAA GCTTTGTAAA CAATTGTCTT TAGCTTATCC AGAGGAATTG 

501 AGTCFGGAGT AAAGACCCAA ATATTGACCT AGATAAAGTT GACTCACCAG 

551 CCCTCGGAGG ATGGAAAGAT GGCCTTAAAA TAAAACAAAC AAAAACCTTT 

601 TTTGCTTTAT TTTG TAGGAC CACTATGAGA CCTCAGTCCT TCTATGATGG 

651 AAGTCATTCT TGTGCCAGAG GAGTGTGCCA TATTCCAATG GCTGAGCCGT 

701 TTTGCCCCAA AACGAGAGAG GTGTGTAGTC TTTCTGGAAG GTGTACCAGA 

751 ATAAATCATG TGGGCTTGGG GTGGCATCTG GCATTTGGTT AATTGGCAGA 

801 CGGAGTQGCC CCATACCCTC ACTCAAGTTT GCTTTGTATT ATGCAAGTTT 

851 ATGGAGAGTT A TTF C CTGTT GCTAATAATT TNNNNNNNNN NNNNNNNNNN 

901 AAGTGCAGCC TTAAGTTGTG CATGTGCTAG TATGTTTTGA AGTTTCTGGT 

951 1 1 HUM I It TA GGTTCTTA TAGAGACTGC TAAGAAGCTA GGACTCCGGT 

1001 GCCACTCAAA GGGGACAATG GTCACAATCG AGGGACCTCG TTTTAGCTCC 

1051 CGGGCAGAAA GCTTCATGTT CCGCACCTGG GGGGCGGATG TTATCAACAT 

1101 GACCACAGTT CCAGAGGTGQ TTCTTGCTAA G6AGGCTGGA ATTTGTTACG 

1151 CAAGTATCGC CATGGGCACA GATTATGACT GCTGGAAGGA GCACGAGGAA 

1201 GCAGTAGGTG GAATTCTTTT CTAAGCACAT ATAGCATGGG TTTCTGGGTG 

1251 CCAATAGGGT GTCTTAACTG TTTGTTTCTA TTACGTTAGT TTCAGAAAGT 

1301 GC CIIIC I A C AAGGTTTTGA AGTTGTTAAT ATTTTCTGTA GTTCCATTGG 

1351 AAGGTAAGAA CAAAGATCAA AAGAAAGAAA GAGACACTTT TACCCAAGGA 

1401 TCAGTAGTGA AAATAGTACA TTGTAGGCAT GTAGATGTGT TGAGAATCAT 
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